(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) World Intellectual Property Organization 
International Bureau 

(43) International Publication Date 
HJuly2002 (11.07.2002) 




II 



PCT 



(10) International Publication Number 

WO 02/054188 A2 



(51) International Patent Classification 7 : G06F 

(21) Internationa! Application Number: PCT/US02/0O466 

(22) International Filing Date: 7 January 2002 (07.01.2002) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

09/755,623 



5 January 2001 (05.01.2001) US 



(63) Related by continuation (CON) or continuation-in-part 
(CIP) to earlier application: 

US 09/755,623 (CON) 

Filed on 5 January 2001 (05.01.2001) 

(71) Applicant (for all designated States except US): SYMYX 
TECHNOLOGIES, INC. [US/US]; 3100 Central Ex- 
pressway, Santa Clara, CA 95051 (US). 



(72) Inventor; and 

(75) Inventor/Applicant (for US only): DORSETT, Jr., 
David, R. [US/US]; 3281 Maryland Court, Pleasanton, 
CA 94588 (US). 

(74) Agents: PORTER, Timothy, A. et ah; Fish & Richardson 
P.C., Suite 500, 500 Arguello Street, Redwood City CA 
94063 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, 
SL, TJ, TM, TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, 
ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 

[Continued on next page] 



(54) Title: LABORATORY DATABASE SYSTEM AND METHODS FOR COMBINATORIAL MATERIALS RESEARCH 

(57) Abstract: Systems, methods, and apparatus, including computer pro- 
gram apparatus, arc described for implementing techniques for processing 
data from a combinatorial experiment. The techniques include receiving data 
from a chemical experiment on a library of materials having a plurality of 
members and generating a representation of the chemical experiment. The 
representation includes data defining an experiment object having a plurality 
of properties derived from the chemical experiment. The experiment object 
is associated with the library of materials. The representation also includes 
data defining one more element objects. Each clement object is associated 
with one or more members of the library of materials. A data model and 
corresponding data structures for describing such experiments are also dis- 
closed. 



< 

00 



IT) 

© 

o 




320 



wo 02/054188 A2 i nil mi mi ii mm mil nil i nil] urn nil inn inn iim iin nun mi sii nn 



European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 
GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAP1 patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 
NE, SN, TD, TG). 



For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations" appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



Published: 

— without international search report and to be republished 
upon receipt of that report 



WO 02/054188 



PCT/US02/00466 



LABORATORY DATABASE SYSTEM AND METHODS FOR 
COMBINATORIAL MATERIALS RESEARCH 

TECHNICAL FIELD 

This invention relates to database systems and methods for storing and 
manipulating experimental data. 

BACKGROUND 

The discovery of new materials with novel chemical and physical properties often 
leads to the development of new and useful technologies. Traditionally, the discovery and 
development of materials has predominantly been a trial and error process carried out by 
scientists who generate data one experiment at a time. This process suffers from low 
success rates, long time lines, and high costs, particularly as the desired materials increase 
in complexity* There is currently a tremendous amount of activity directed towards the 
discovery and optimization of materials, such as superconductors, zeolites, magnetic 
materials, phosphors, catalysts, thermoelectric materials, high and low dielectric materials 
and the like. Unfortunately, even though the chemistry of extended solids has been 
extensively explored, few general principles have emerged that allow one to predict with 
certainty the composition, structure and/or reaction pathways for the synthesis of such 
solid state materials. 

As a result, the discovery of new materials depends largely on the ability to 
synthesize and analyze large numbers of new materials. Given approximately 100 
elements in the periodic table that can be used to make compositions consisting of two or 
more elements, an incredibly large number of possible new compounds remain largely 
unexplored, especially when processing variables are considered. One approach to the 
preparation and analysis of such large numbers of compounds has been the application of 
combinatorial chemistry. 

In general, combinatorial chemistry refers to the approach of creating vast 
numbers of compounds by reacting a set of starting chemicals in all possible 
combinations. Since its introduction into the pharmaceutical industry in the late 1980's, it 
has dramatically sped up the drug discovery process and is now becoming a standard 
practice in that industry (Chem. Eng. News Feb. 12, 1996). More recently, combinatorial 
techniques have been successfully applied to the synthesis of inorganic materials (G. 
Briceno et al., SCIENCE 270, 273-275, 1995 and X. D. Xiang et al, SCIENCE 268, 
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1738-1740, 1995). By use of various surface deposition techniques, masking strategies, 
and processing conditions, it is now possible to generate hundreds to thousands of 
materials of distinct compositions per square inch. These materials include high T c 
superconductors, magnetoresistors, and phosphors. 

Using these techniques, it is now possible to create large libraries of diverse 
compounds or materials, including biomaterials, organics, inorganics, intermetallics, 
metal alloys, and ceramics, using a variety of sputtering, ablation, evaporation, and liquid 
dispensing systems as disclosed in U.S. Patents No. 5,959,297, 6,004,617 and 6,030,917, 
which are incorporated by reference herein. 

The generation of large numbers of new materials presents a significant challenge 
for conventional analytical techniques. By applying parallel or rapid serial screening 
techniques to these libraries of materials, however, combinatorial chemistry accelerates 
the speed of research, facilitates breakthroughs, and expands the amount of information 
available to researchers. Furthermore, the ability to observe the relationships between 
hundreds or thousands of materials in a short period of time enables scientists to make 
well-informed decisions in the discovery process and to find unexpected trends. High 
throughput screening techniques have been developed to facilitate this discovery process, 
as disclosed, for example, in U.S. Patents No. 5,959,297, 6,030,917 and 6,034,775, which 
are incorporated by reference herein. 

The vast quantities of data generated through the application of combinatorial 
and/or high throughput screening techniques can easily overwhelm conventional data 
acquisition, processing and management systems. Existing laboratory data management 
systems are ill-equipped to handle the large numbers of experiments required in 
combinatorial applications, and are not designed to rapidly acquire, process and store the 
large amount of data generated by such experiments, imposing significant limitations on 
throughput, both experimental and data processing, that stand in the way of the promised 
benefits of combinatorial techniques. 

Basing laboratory data management systems on current relational or object- 
oriented databases leads to significant limitations. Those based on relational systems 
struggle to provide a facility for effectively defining and processing data that is 
intrinsically hierarchical in nature. Those based on current object-oriented databases 
struggle to offer the processing throughput necessary and/or may lack the flexibility of 
recomposition of the internal data or direct access into the internal structures of the 
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objects as relational systems do with the relational view. Thus, there is a need for 
laboratory data management systems that combine the ability to process hierarchical data 
offered by object-oriented approaches and the processing power and/or flexibility of 
relational database systems. 

SUMMARY 

In general, in one aspect, the invention provides methods, apparatus, including 
computer program apparatus, and laboratory data management systems implementing 
techniques for processing data (including, e.g., receiving, manipulating and/or storing 
data) from chemical experimentation for or on a library of materials or a subset of such a 
library of materials. The techniques can include receiving data from a chemical 
experiment on a library of materials having a plurality of members and generating a 
representation of the chemical experiment The representation includes data defining an 
experiment object having a plurality of properties derived from the chemical experiment. 
The experiment object is associated with the library of materials. The representation also 
includes data defining one or more element objects, each of which is associated with one 
or more members of the library of materials. 

Particular implementations of the invention can include one or more of the 
following advantageous features. The chemical experiment can have a type that is one of 
a pre-defined set of one or more experiment types. The representation can implement a 
data model describing the set of experiment types, the data model including an 
experiment base class having a set of experiment base class properties including a 
classname property for identifying a derived experiment class and a library ID property 
for identifying a library of materials, and one or more derived experiment classes, each of 
which is associated with one of the experiment types and has a plurality of derived 
experiment class properties derived from the associated experiment type. The chemical 
experiment can be represented by a first experiment object instantiated from the derived 
experiment class associated with the type of the relevant chemical experiment, and by a 
second experiment object instantiated from the experiment base class, the classname 
property of the second experiment object having a value identifying the derived 
experiment class associated with the experiment type of the chemical experiment, and the 
library ID property of the second experiment object having a value identifying the library 
of materials. 
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library ID property of the second experiment object having a value identifying the library 
of materials. 

The representation can include data defining one or more data set objects or image 
objects. Data set objects can include sets of one or more values derived from the 
chemical experiment, each value being associated with one or more of the members of the 
library of materials. Image objects can include data representing a state of some or all of 
the members of the library of materials at a time during the chemical experiment. Data 
set objects and image objects can be associated with properties of an associated 
experiment object. The representation can include a self-describing representation of the 
chemical experiment, such as an XML string, and can also include a Java object, a COM 
DDL interface or a CORBA DDL interface describing the chemical experiment. 

The techniques can also include parsing the representation to map the data from 
the chemical experiment to tables in a relational database based on the properties of an 
associated experiment object. Parsing the representation can include identifying each of a 
plurality of XML entities in an XML stream, each entity having associated content; 
mapping each XML entity into a corresponding object property; and assigning the content 
associated with an XML entity to a database table based on the corresponding object 
property. Derived experiment class properties can include properties derived from 
parameters of an associated experiment type. Data set object values can be derived from 
or measured for parameters of the chemical experiment, and data set objects can be 
associated with experiment object properties derived from the corresponding experiment 
parameters. 

The techniques can also include storing the content in the assigned database table 
in the relational database, and the database can be searched to return a search result 
including data identifying a set of element objects satisfying search terms specified for 
one or more searchable fields. Search results can be stored as lists of element objects that 
satisfy the search terms of a query. Element object values can be displayed for one or 
more dispiayable fields. Object representations of chemical experiment data can be 
reconstructed from the database based on an object identifier specifying content to be 
retrieved from the database, from which content an object representation is generated 
based on a class name included in the specified content. The object representation can be 
mapped to an XML stream describing the content. 
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In general, in another aspect, the invention provides a data model for describing 
data from a set of pre-defined types of chemical experiments capable of being performed 
on a library of materials. The data model includes an experiment base class, one or more 
derived experiment classes, and an element class. The experiment base class has a set of 
experiment base class properties including a classname property for identifying a derived 
experiment class and a library ID property for identifying a library of materials. The 
derived experiment classes are associated with respective experiment types and have a 
plurality of derived experiment class properties derived from the associated experiment 
type. The element class has a plurality of element class properties including a position 
property for identifying one or more members of a library of materials and a value 
property for storing a value derived from a chemical experiment for the members 
identified by the position property. A specific experiment in the set of pre-defined 
experiments is represented by a first experiment object instantiated from the derived 
experiment class associated with the type of the chemical experiment, and by a second 
experiment object instantiated from the experiment base class. The classname property of 
the second experiment object has a value identifying the derived experiment class 
associated with the experiment type of the chemical experiment, while the library ID 
property of the second experiment object having a value identifying the library of 
materials. 

Advantages that can be seen in implementations of the invention include one or 
more of the following. Decoupling of client processes from the database isolates the 
experimental process from the underlying data storage, which means that client processes 
may be extended without requiring immediate extension of the database schema, and/or 
that the database is protected from unintended and/or unauthorized alteration by client 
processes. The database can be extended and remain backward compatible with existing 
client processes. Data is persisted in a human-readable format, aiding in error diagnosis. 
Using a common schema to store data describing different experiment types means that 
objects representing experimental data can be recomposed across technical disciplines. 

The details of one or more embodiments of the invention are set forth in the 
accompanying drawings and the description below. Other features, objects, and 
advantages of the invention will be apparent from the description and drawings, and from 
the claims. 
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DESCRIPTION OF DRAWINGS 

FIG. 1 is a block diagram illustrating a laboratory data management system 
including a database server process according to the invention. 

FIG. 2 is a flow diagram illustrating a method for storing data from an experiment. 

FIG. 3 illustrates a user interface for creating a library. 

FIGS. 4A-4B illustrate a portion of an XML document defining a data object 
representing an experimental data set and a relational database table into which such an 
object is mapped, respectively. 

FIGS. 5A-5B illustrate portions of an XML document defining a data object 
representing an instrumentation procedure for a materials-handling apparatus. 

FIG. 5C illustrates a relational database table into which the data object of FIGS. 
5 A-5B is mapped. 

FIGS. 6A-6D illustrate portions of an XML document defining a set of data 
objects representing a polymerization experiment. 

FIG. 6E illustrates a set of relational database tables into which the set of objects 
of FIGS. 6A-6D is mapped. 

FIG. 7 is a flow diagram illustrating a method for retrieving data from a database. 

FIGS. 8 A-8D illustrate user interfaces for constructing a query and viewing query 

results. 

FIGS. 9A-9E illustrate user interfaces for viewing data retrieved from a database 

according to the invention. 

Like reference symbols in the various drawings indicate like elements. 

DETAILED DESCRIPTION 

Figure 1 illustrates a laboratory data management system 100 that includes a 
general-purpose programmable digital computer system 110 of conventional construction 
including a memory 120 and a processor for running a database server process 130, and 
one or more client processes 140. As used in this specification, a client process is a 
process that uses services provided by another process, while a server process is a process 
that provides such services to clients. Client processes 140 can be implemented using 
conventional software development tools such as Microsoft® Visual Basic®, C++, and 
Java™, and laboratory data management system 100 is compatible with clients developed 
using such tools. In one implementation, database server process 130 and client 
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processes 140 are implemented as modules of a process control and data management 
program such as that described in U.S. Application Serial No. 09/550,549, filed April 14, 
2000, which is incorporated by reference herein. 

Optionally, client processes 140 include one or more of automated or semi- 
automated laboratory apparatuses 150, a user interface program 160 and/or a process 
manager 170 for controlling laboratory apparatus 150. Database server process 130 is 
coupled to a database 180 stored in memory 120. In one implementation, database server 
process 130 and client processes 140 are implemented as computer programs stored on 
computer-readable media, and can be distributed across multiple computers. Optionally, 
laboratory data management system 100 also includes input/output devices 190 and 
conventional communications hardware and software by which computer system 110 can 
optionally be connected to other computer systems by a computer network, such as a 
local area network, wide area network or the Internet. 

In general, laboratory data management system 100 receives data from client 140 
for storage, returns an identifier for the data, provides a way of retrieving the data based 
on the identifier, provides the ability to search the data based on the internal attribute 
values of the data, and the ability to retrieve data from the these queries in a number of 
different ways, generally in tabular (e.g., in a relational view) and object forms. 

Laboratory data management system 100 is configured to manage data generated 
during the course of the experiments performed by laboratory apparatus 150. An 
experiment is performed on a library of materials. As used in this specification, a library 
of materials is a matrix having two or more members, generally containing some variance 
in chemical or material composition, amount, reaction conditions, and/or processing 
conditions. A member, in turn, represents a single constituent, location, or position in a 
library containing one set of chemicals or materials subject to one set of reaction or 
processing conditions. While this specification uses specific types of experiments as 
examples, the particular materials, compounds or chemistries involved are not critical; 
instead, the methods, computer programs and systems described are broadly applicable to 
a wide variety of library types and chemistries. Thus, in particular implementations, 
libraries used in system 100 can include, for example, libraries of biomaterials, organics, 
organometallics, inorganics, intermetallics, metal alloys, or ceramics, and in particular 
heterogeneous and homogeneous catalysts, specialty application polymers and 
formulations of organic and inorganic materials including, for example mixtures of 
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polymers and/or bioactive materials with any other materials. Such libraries can be 
employed in experiments directed to the discovery of new and useful compositions, such 
as superconductors, zeolites, magnetic materials, phosphors, catalysts, thermoelectric 
materials, high and low dielectric materials, and other materials of interest. Experiments 
on such libraries can involve the measurement of a variety of properties, including 
without limitation electrical, thermal, mechanical, morphological, optical, magnetic, 
chemical, conductivity, super-conductivity, resistivity, thermal conductivity, anisotropy, 
hardness, crystallinity, optical transparency, magnetoresistance, permeability, frequency 
doubling, photoemission, coercivity, dielectric strength, or other useful properties that 
will be apparent to those of skill in the art, and can yield data in the form of , and can 
generate data in the form of, for example, electrical voltages and resistivity, photon 
counts, and frequency and wavelength of electromagnetic radiation, any of which could 
be measured on a time-varying basis or as a combination of each other in the form of 
dependent-independent variables. 

Libraries can include physical arrays of materials, with different materials located 
at different regions of a substrate. In one implementation, each library includes one or 
more members, each of which may be represented as a region in an arrangement (e.g., an 
array) of one or more regions. A library can include two or more members, preferably 
four, ten, twenty, or even ninety-six or more members. The members of a library may, 
but need not necessarily, correspond to locations on a physical substrate (such as a 
microtiter plate, wafer or the like) on which the library was or will be created. However, 
while the library may correspond to the geometry of the ultimate physical substrate, it 
may also represent a collection of library members on a more conceptual level. 
Conventionally, libraries may be depicted as square or rectangular arrays. However, 
libraries can be represented or prepared in any convenient shape, such as square, 
rectangle, circle, triangle or the like, and in one, two or three dimensions, depending, for 
example, on the underlying chemistry or apparatus involved. Details of library design 
and preparation are further discussed in co-pending U.S. Application No. 09/420,334, 
filed on October 18, 1999, and in U.S. Patent No. 5,776,359, U.S. Patent No. 5,959,297, 
U.S. Patent No. 5,985,356, U.S. Patent No. 6,030,917, U.S. Patent No. 6,034,775, and 
U.S. Patent No. 6,149,882, each of which is incorporated by reference herein. 

An experiment need not be performed on all members of a library, but may be 
performed on one member or a subset of members in the library. As used in this 
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specification, an experiment can include using instruments to create or manipulate 
samples and/or gather data, as well as processing (or reprocessing) data gathered during 
previous experiments. In one implementation, experimental data (whether measured 
directly or derived indirectly from measured data) can be represented in system 100 as a 
XYDataSet object, a data structure holding values measured or derived for one or more 
members of a library of materials - for example, an array having an element for each 
member in a corresponding library of materials (or a subset of those members), with each 
array element having a value corresponding to a measured or derived experimental value 
for the corresponding library member. Data can also be represented as an Image object, a 
two-dimensional, time-fixed data set representing the state of a library at a particular 
time, such as a spectrogram or photograph of a physical substrate embodying a library at 
a particular point in time. 

Laboratory apparatus 150 can include any automated or semi-automated device 
for handling or processing materials, gathering experimental data or analyzing chemical 
properties, such as synthesis or analysis robots, pumps, temperature or pressure 
controllers,- reactors, digital cameras for visible or infrared imaging, evaporative light 
scattering detectors, microcalorimetry arrays and the like. Exemplary apparatus 150 for 
material processing and screening are disclosed in U.S. Patent No. 5,776,359, U.S. Patent 
No. 5,959,297, U.S. Patent No. 5,985,356, U.S. Patent No. 6,030,917, U.S. Patent No. 
6,034,775, and U.S. Patent No. 6,149,882, previously incorporated by reference herein, as 
well as PCT publications WO 00/09255, WO 00/51720, and WO 00/36410, each of 
which is incorporated by reference herein. Optionally, apparatus 150 are controlled by an 
automated instrument control software application such as that described in U.S. 
Application Serial No. 09/305,830, filed on May 5, 1999, which is incorporated by 
reference herein. 

Database 180 stores experimental and analytical data derived from experiments 
performed by laboratory data management system 100. In one implementation, 
laboratory data management system 100 maintains three representations of each item of 
data: an object representation; a self-describing persistent representation and a 
representation based on relational tables. 

A method 200 for storing data from an experiment performed by laboratory 
apparatus 150 is illustrated in FIG. 2. The method starts when process manager 170 
receives from laboratory apparatus 150 a set of data derived from the experiment (step 
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210) - for example, a set of gel permeation chromatograms measured for each of the 
members in a library of polymers. Process manager 170 parses the data set (step 220), 
e.g., to associate individual data values with their corresponding library members, and 
packages the data for transmission to database server process 130, such as by generating 
an XML stream containing the data and associated mark-up identifying the type of data 
and its association with the members of the corresponding library of materials (step 230). 
Process manager 170 then sends database server process 130 a database access request 
including the packaged data (step 240). 

Database server process 130 parses the data, extracting for each object represented 
in the XML stream an object type and optionally one or more associated object properties 
(step 250), as will be described in more detail below. Database server process 130 uses 
the extracted object information to store (at least temporarily) each object in the stream in 
database 1 80 (step 260) - for example, by mapping each object to one or more database 
tables according to a mapping schema implemented in database server process 130. 

In some implementations, system 100 can support multiple schema for mapping 
experimental data (e.g., in XML form) into one or more alternate logical representations 
in relational form. The mapping can be partial - for example, loading a subset of the 
experimental data into relational tables. Alternate mappings can be used to support 
different applications of the data in relational form. For example, one mapping can be 
used to populate a specific set of tables with experimental data in such a way as to 
optimize a certain set of searching or analytical systems, including but not limited to the 
method 400 of retrieving data described herein. Additional mappings can be added to 
populate tables for the purpose of experimental documentation. Similarly, two equivalent 
mappings can be used to provide a means of populating a back-up database. 

In one implementation, client processes 140 interact with experimental data 
generated in system 100 through an object model representing experiments performed by 
system 100, illustrated in FIG. 3. In this object model, every experiment performed by 
system 100 is represented by an Experiment object 330 having a set of associated 
properties and methods that represent the experiment. Each Experiment object 330 is 
associated with a particular library of materials 300 - for example, by virtue of an 
associated LibrarylD 320 (here represented as a barcode), assigned when the library is 
created, that uniquely identifies the library in the universe of libraries defined for an 
instance of system 100. 
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The user registers a new library in system 100 by requesting a new LibrarylD for 
the new library. In response to the request, database server process 130 generates a 
LibrarylD for the new library, and defines a library geometry for the library - for 
example, by prompting the user to input the number of rows and columns in the new 

5 library. Optionally, the user can also specify values for one or more searchable fields for 
the library, including a library name, project name, notebook and page numbers, start and 
end dates, comments and keywords to be associated with the new library by making 
appropriate entries in fields provided by user interface program 160. The user can also 
provide information about the library to other users by specifying a status for the new 

10 library, such as "In progress", to specify, e.g., that synthesis of the library is underway, 
"Invalid" or "Questionable", to denote problems with the library, "Released", to specify 
that the library is available for use by other users of system 100, and "Archived", to 
reflect, e.g., that the library has been sent off-site for storage, to name just a few. Finally, 
the user can also associate one or more data files with the new library by specifying the 

15 desired files through user interface program 160. When it has received the necessary 
information, database server process 130 creates an Experiment object representing the 
preparation of the library — e.g., a Synthesis object, as described below. Optionally, 
database server process 130 can also create a separate object representing each new 
library (such as, for example, a Library object having properties specifying, e.g., 

20 LibrarylD, library geometry, the identity of materials making up the library and the like). 
After creating the appropriate data structure or structures to represent the new library, 
database server process 130 records identifying information in database 180 - for 
example, by updating a master concordance table in database 180 to associate the new 
library with a project identified by the specified project name. 

25 One implementation of a generic Experiment object 330 can include the following 

properties in addition to LibrarylD 320: ExperimentID, which is a number uniquely 
identifying the experiment within system 100; CreationDate, which is a date reflecting the 
date of the underlying experiment; CreatedBy, a string identifying the user who 
performed the experiment or created the Experiment object; Notebook, which is an 

30 integer identifying the laboratory notebook in which the experiment is recorded; 

Keywords, which are strings specifying searchable terms identifying the experiment; and 
the like. 
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Experiment subclasses (or derived Experiment classes) can be defined to 
correspond to one or more specific types of experiments capable of being performed by 
system 100. Thus, for example, a parallel pressure experiment can be represented by a 
derived PPRExperiment class that inherits the properties of the base Experiment class and 
has additional experiment-specific properties such as XYDataSet objects holding 
measured or derived values for pressure, temperature, conversion for each element or 
member of a library of materials. Similarly, a gel permeation chromatography 
experiment can be represented by a GPCExperiment subclass that, in addition to the 
properties of the base Experiment class, has additional experiment-specific properties 
such as a XYDataSet object storing chromatograms measured for each library element or 
member. In general, derived Experiment classes can be defined to describe any type of 
experiment capable of being performed on a library of materials by apparatus 150 

associated with system 100. 

In one implementation, each library 300 is represented by at least one Experiment 
object 330, including a Synthesis object (instantiated from a Synthesis subclass of the 
Experiment base class) reflecting the library's preparation. In addition to properties 
inherited from the Experiment base class, a Synthesis object has additional properties 
denoting the library's geometry (e.g., the number of rows and columns in the matrix 
representing the library of materials), and may also have properties corresponding to the 
methods used to prepare the library, such as data for reagents, processing conditions and 
the like. Additional experiments performed on the same library can result in multiple 
Experiment objects representing (e.g., referring to) the library. Optionally, experiments 
that result in reformulation or other alteration of the library (e.g., experiments that would 
result in a significantly different composition of matter and/or phase) can result in the 

creation of a new library. 

An Experiment object 330 can (but need not necessarily) also include a collection 
of Element objects 340 representing individual members 3 10 of library 300. Each 
Element object 340 represents a particular material in library 300 on which an experiment 
is to be or has been performed, and is characterized by a Position property, identifying the 
particular member 310 of the library, and a Value property containing a measured or 
derived data value for that library member. In one implementation, Element objects can 
be hierarchical children of a parent Experiment object, and can, for example, be 
referenced as individual children in a collection child attribute of the parent Experiment 
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object. However, Element objects (and the experimental data they contain) can also be 
retrieved and manipulated independently of their parent Experiment. 

Database 180 can also store other objects, including, for example, database 
queries and lists (described in more detail below), as well as a collection of pre-defined 
objects and object prototypes available for process manager 170 to use to control 
laboratory apparatus 150, as described in commonly-owned co-pending U.S. Application 
Serial No. 09/550,549, filed April 14, 2000, which is incorporated by reference herein. 

Each object has a set of properties that can include, e.g., object metadata, 
attributes and joins. Object metadata includes information defining the object class, such 
as an object description, an object type (e.g., Experiment, Element, Other), and a choice 
of a set of flags, such as Queryable, Updateable, Insertable, Common, and Retrievable in 
Recordset, that can be assigned to instances of the object class by a user. Object attributes 
include the set of properties that can be assigned values for any given instance of the 
object. Each attribute may be described, e.g., by name, description, the type of data the 
attribute stores (for example integer data, floating point, text strings, image, or x-y data), 
and other properties such as whether the data is user or system assigned, if it can be 
updated, if it can be retrieved in a tabular representation of the object type or only through 
the full object representation, and if the attribute should be presented to the user as having 
a vocabulary of permitted values, either specific fixed values or those from the current set 
of instances of the object stored. Finally, object joins specify the relationships between 
the object and other objects available to system 100, such as the names of any parent or 
child objects and attributes of those objects. 

Communication between client processes 140 and database server process 130 is 
provided through a persistent representation of data in a self-describing extensible format 
such as XML. As described above, client processes 140 receive or generate data derived 
from an experiment and package that data using known techniques in a format (e.g., an 
XML data stream) for communication to database server process 130. Upon receiving 
such a communication from a client process 140, database server process 130 parses the 
incoming data stream to extract object descriptions from the data stream. In one 
implementation, an object type corresponds to a top-level entity in the XML data stream, 
while sub-entites in the stream map to properties of the respective objects. For example, 
each top-level XML entity may have a name that corresponds to an object type defined in 
the metadata of database 1 80. The sub-entities of each top-level entity then map to 

13 



WO 02/054188 



PCT/US02/00466 



properties defined for the corresponding object, for example by sub-entity name. In this 
implementation, some entities may map to simple object properties, such as strings, longs, 
doubles, etc., while others may map to sub-objects or collections of sub-objects. Basic 
object properties that database server process 130 needs to store an object in database 180 
in step 260 may be included as attributes set out in the XML tag that includes the entity 
name. 

Database server process 130 maps the extracted experimental data into tables in 
relational database 1 80 (which can be implemented in any commercially available 
relational database, such as those offered by Oracle®, Sybase®, Informix®, IBM® or the 
like) according to a predetermined mapping schema. In one implementation, database 
server process 130 maps classes (e.g., the Experiment class and any subclasses, the 
Element class, etc.), to database tables, with each row representing an individual instance 
of the class or classes in the table (e.g., an experiment identified by a unique 
ExperimentID) and columns corresponding to class properties (e.g., for an experiment, 
the individual ExperimentID primary key, project name, experiment name, notebook 
number or the like, although not all properties need necessarily be saved in database 1 80). 

The following example illustrates the communication of an Experiment object 
from a client process 1 40 to database server process 1 30 to database 1 80. After 
performing a particular experiment, a client process 140 generates the following XML 
code: 

Experiment ID="0" ConCheck^O" PersistState="l" UserlD-" j smith"> 
<Project>test</Project> 
<Status>6</Status> 
<Notebook>l</Notebook> 
<Pages>l-5</Pages> 
<Flags>0</Flags> 
<Name>simple test</Name> 
<Keywords> 

<Keyword ID="0" ConCheck^O" PersistState^l" 

UserID="jsmith"> 

<Value>keyword 1</Value> 

</Keyword> 

<Keyword II>= u 0" ConCheck="0" PersistState^ u l" 

UserID="jsraith"> 

<Value>keyword 2</Value> 
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</Keyword> 
</Keywords> 
attachments /> 
<XML Attachments /> 
< /Experiment > 



By sending this XML stream to database server process 130 in step 240, process 
manager 170 requests database server process 130 to save a new Experiment object (a 
save request is indicated because the specified ID for the object is "0"; a request to 
modify or delete an existing Experiment object would include a previously-defined 
Experiment ID) for the user identified by UserlD "jsmith" The Experiment object to be 
saved has been assigned the name "simple test", is part of a project called "test", and is 
associated with pages 1 through 5 of notebook number 1 . The object has two Keyword 
sub-objects, "keyword 1" and "keyword 2". Although the Experiment object definition 
provides for the association of Attachment and XML Attachment sub-objects with each 
experiment, none are provided for the "simple test" experiment in this instance. 

* Assuming the user "jsmith" has appropriate system authorization, after parsing 
this object in step 250, database server process 130 assigns an Experiment ID and 
Keyword IDs and in step 260 stores the information as follows (in EXPERIMENT and 
KEYWORD tables) in database 1 80: 

EXPERIMENT 

ID PROJECT NAME NOTEBOOK PAGES 

999 test simple test 1 1-5 

KEYWORD 

ID EXPERIMENT ID VALUE 

222 999 keyword 1 

223 999 keyword 2 



As discussed above, experimental data can be parsed into multiple database 
representations. Thus, for example, database 1 80 can include a set of tables comprising a 
relational schema, with the tables being designed to record the progress of multiple 
different experiments performed on a library. For this purpose, each experiment can be 
stored as a separate row in a generic Experiment table, with columns for the date, library 
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ID, project, specific experiment type, etc. as described above. The experiment-specific 
data can then be stored in separate experiment-specific tables. 

The tables below illustrate a representation of the data in the relational database 
resulting from the processing of two screening experiments into a relational schema built 
for the above purposes: 



EXPERIMENT 

ID LIBRARYID PROJECT 

1 1 test 

2 2 test 



NAME NOTEBOOK PAGES 

screen 1 1 1-5 

screen 2 1 5-7 



SCREEN1 

EXPERIMENT ID TEMPERATURE RESULT 

1 40 2.0 

2 35 2.2 



SCREEN2 

EXPERIMENTID RESULT 

1 25.3 

2 35.4 



At the same time, system 100 can support a second a set of tables comprising a 
relational schema designed for the purpose of correlating a set of experimental facts 
across a specific library. For this purpose, the table design can include the library 
screening data directly within the main table, thus allowing for direct retrieval of 
screening data without the need for a query involving multiple tables. 

The table below shows a representation of the data in the relational database 
resulting from the processing of the two screening experiments discussed immediately 
above into a relational schema built for this purpose (some fields in the EXPERIMENT 
table have been omitted for clarity): 



EXPERIMENT 

ID LIBRARYID SCREEN1TEMP 
11 40 
2 2 35 



SCREEN1RESULT SCREEN2 RESULT 
2.0 25.3 
2.2 35.4 
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The process of parsing the intermediate representation and mapping that 
representation to tables in database 180 (steps 250 and 260 above) will now be described 
in more detail by reference to additional specific examples. In one implementation, 
database server process 130 extracts object information from an XML data stream and 
uses that information to create objects (e.g., Java objects) from the XML stream. The 
main external entry point for storing (insert or update) objects in database 180 can be 
implemented as a method public long Save (String xml), exemplified in Java 

code Listing 1, below. 

This method accepts a string containing the XML representation of the object and 
returns the resulting ED of the object (> 0) or an error code (<== 0). The overall process is 
to create and map the data from the XML document into objects (singular or plural, which 
may be implemented in known programming languages such as Java, Eiffel and/or the 
COM or CORBA interface definition languages), then store the contents of the objects 
using, e.g., a commercial Java class package, such as version 2.0 of the Visual Business 
Sight Framework (VBSF), available from Objectmatter, Inc. 

(http://www.objectmatter.com). Although this example is discussed in the context of Java 
objects and the Objectmatter class package, the particular programming language and/or 
class package used is not critical to the invention. Preferably, the XML stream is not 
exposed to any dependency or use of the class package capabilities, which therefore need 
not limit or influence the design and construction of XML documents representing 
objects. It is expected that the specific implementation of the storage of the objects could 
•change with no impact on the experimental object design, the XML document 
representation of the objects, the interface of database server process 130, or the use of a 
relational database for the final data storage. Thus, for example, database server process 
130 can be configured to map experimental data directly from the XML representation 
into database 180 (e.g., using known software packages for mapping XML documents 
into relational databases). 

The Save method loads the XML stream into an XML parser, which may be any 
commercial XML document object model (DOM) parser such as the Microsoft XML 
parser available with Internet Explorer 5.01, supplemented by wrappers implemented here 
as an additional separate class chentutiUties.XMLHelper and the interface 
clientutilities.IXMLEntity representing an individual entity in the XML document. Once 
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the root object of the XML stream has been obtained, the SaveXML method begins the 
process of constructing a Java object from the XML stream. 

If the request is to save an object with a non-zero ID, the XMLNodeToObject 
method that SaveXML calls creates a corresponding Java object and retrieves the 

5 specified object data from the database. If the object does not exist or the ID is 0, the 
XMLNodeToObject method simply returns the new Java object. SaveXML then 
compares the value of the ConCheck attribute in the incoming XML stream and that of 
the database. If these values are not equal, this indicates that the client is attempting to 
update an object using an out of date set of data and a "concurrency check" failure is 

1 0 returned. The ConCheck value is incremented with each successfully update of an object, 
reflecting its use as an object version number. 

Once the initial data or newly initialized Java object has been created, SaveXML 
processes each child XML entity of the object, attempting to map the XML node into an 
object attribute, a sub-object reference, or a collection of sub-objects. This is 

15 accomplished in the ReadXMLNodeO method discussed below. 

Once the XML entity nodes have been processed, if the object has a ClassName 
attribute, the SaveXML method stores the name of the Java class/XML object being 
saved into that attribute. This supports hierarchies of persistent objects where the storage 
is divided among several tables - that is, the storage is essentially a process of storing the 

20 base class attributes followed by storage of the derived class attributes, as in the example 
of the design of a PPRExperiment object to capture a polymerization reactor experiment 
using the Experiment object as a base class. This is discussed in more detail below. 

The ReadXMLNode method first determines if the requested operation is a 
deletion or a nullification of the node data and handles those special requests. It then 

25 attempts to find a best match for the XML entity node against the properties of the Java 
object being stored by examining the first sub-entity name. In the XML DOM, an entity 
with only data has a single sub-node with the name "#texf \ If this is found as a sub- 
node, this is interpreted as the simple case of a mapping of an XML node to directly to a 
Java class field, and the value of the XML node is used to set the value of the Java class 

30 field. 

If the XML entity node has something other than a simple text value, the matching 
Java class field is obtained as guidance for how to process the XML sub-entities. A test is 
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made to determine if the Java class field is a String field, and if so, the entire sub-entity 
stream is stored whole into a single Java field. 

This test permits the assignment of an arbitrary section of the XML document into 
a single Java field and subsequently a relational table column, permitting a short-cut 
whereby each XML entity is not required to be mapped into a specific relational table 
column. This may be advantageous where a section of the XML document is expected to 
change frequently as the result of iterations of software, instrument, or laboratory 
workflow development, adding or modifying the experimental data stream. This may also 
be useful where a section of the XML document does not represent data that would be 
commonly searched by attribute value, but where the section must be stored in whole for 
documentation purposes, such as the storage of an automated materials-handling 
procedure discussed below. 

If the Java field is a collection or reference attribute (e.g., from the Objectmatter 
class package), the XML entity node is processed as a collection of sub-objects or a 
contained single sub-object. This is done recursively to permit the arbitrary nesting of 
objects or collections of objects within other objects, which is important in the 
representation of intrinsically hierarchical data such as laboratory experimental data. 

In one implementation, it is useful (although not necessary to the invention) to use 
the Experiment object as a base class for all experiments. This provides several benefits, 
including: 

1 . The traditional object-oriented design benefit of improving reusability in the sense 
that this base class identifies the common attributes for all laboratory experiments. 

2. The ability to use the Experiment object itself to determine and represent the time- 
ordered sequence of all laboratory experiments related to a combinatorial library. 
The first benefit means that since the Experiment object contains attributes 

common to all experiments (such as library ID, date and time, notebook number and 
page, staff member, and keywords), it is possible to simply query the database for all 
experiments performed on a given library (for example) and immediately produce a list. 

The second benefit is more significant in the overall design of the system: since 
the actual name of the specific Experiment-derived class is stored using of the ClassName 
attribute in the SaveXML method as described above, it is possible to dynamically 
retrieve the specific experimental detail for any type of experiment by simply inspecting 
the value of the ExperimentClassName property retrieved and using that name in a 
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subsequent GetObject request. This permits the construction of universal user software 
that can view any experimental data records from the past, present and future definitions 
of experiment objects, 

FIG. 4A illustrates a portion of an XML document representing an XYDataSet 
object representing data from post catalyst injection measurements on a reactor in the 
Parallel Polymerization Reactor (PPR)™ system developed by Symyx Technologies, Inc. 
of Santa Clara, California. As discussed above, such objects can be used to represent data 
taken from a large variety of different sources, including electrical, temperature and 
pressure sensors and general analog/digital signals. The data, which is a set of IEEE 8 
byte floating point numbers, is encoded in the UTF-8 character set in the XML document 
in accordance with the rules on character sets permitted in XML documents. 

Database server process 130 first maps the XYDataSet object represented by the 
XML document of FIG. 4A into an instance of the Java class whose fields are defined as 
follows: 

. public class XYDataSet { 
public Long ID; 
public Long ConCheck; 
public String CreatedBy; 
public Date CreationDate; 
public String LastModif iedBy; 
public Date LastModif icationDate; 
public byte[] Data; 
public Long Flags; 
public String Title; 
public Long Line; 
public Long LineColor; 
public Long Point; 
public Long PointColor; 
public String XUnits; 
public Double XScale; 
public String XLegend; 
public String YUnits; 
public Double YScale; 
public String YLegend; 
public Long Status; 
public Long XColor; 
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public Long YColor; 
public Double SuggestMinX; 
public Double SuggestMaxX; 
public Double SuggestMinY; 
public Double SuggestMaxY; 
public Long PlotStyle; 
public Double XDataStart; 
public Double XDatalnterval; 
public byte[] YData; 
public byte[] XData; 

} 

Database server process 130 them maps this Java class into the Oracle table shown in 
FIG. 4B. 

FIGS. 5A-5B illustrate portions of an XML document defining an instrumentation 
procedure implemented for Impressionist™ materials-handling software available from 
Symyx Technologies, Inc. of Santa Clara, California. As described in U.S. Application 
Serial No. 09/305,830, filed on May 5, 1999, the Impressionist software can be used to 
control a large number of laboratory devices used in experimentation (synthesis and 
screening). To that end, Impressionist can use XML documents to save definitions of 
instrumentation resources, working substrates, and experimental procedures. Laboratory 
data management system 100 can then be used as a storage repository for all three of 
these XML documents. The example of FIGS. 5A-5B illustrate the processing of a 
procedure definition embodied in an Impressionist_Procedure_l object. 

One challenge presented in storing such procedures is that the nature of the object 
being stored can vary fundamentally with the actual instrument or instruments and 
experiment or experiments being performed. In particular, the presence or absence and 
ordering of entities in the XML document can represent the ordering of (and hierarchies 
in) the experimental procedure. Traditional static methods of mapping XML document 
entities into Java objects and/or relational database table columns are generally ill- 
equipped to handle such data, where mapping might require the explicit creation of a 
large number of storage instructions to account for an effectively infinite number of 
combinations of entities, and the ongoing addition of new instructions as new 
instrumentation is developed. 

Traditional database management systems might attempt to address these 
problems by storing only the entire document (thereby, e.g., equating the database and the 
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file system), but this, too, has disadvantages. For example, the XML documents (i.e., 
instrumentation procedures in this case) might not necessarily be readily identifiable 
through specific contained properties, thus passing responsibility for organizing the 
documents to client software (where, e.g., the client process may explicitly define storage 
of a procedure into a "project" or "computer"-specific repository, much like the process 
of choosing one disk folder over another for storage of a file). Laboratory data 
management system 100 addresses these problems by specifying a subset of the attributes 
from the instrumentation procedure to store explicitly. 

Thus, FIG. 5 A shows the root-level entities in the Impressionist procedure XML 
document, while FIG. 5B shows a portion of the fully expanded XML document 
including a portion of the definition of the experimental procedure. Those skilled in the 
art will note the heavy use of recursion in the procedure definition, which is common in 
laboratory control procedures. 

Based on this XML document, database server process 130 defines a 
corresponding Java object whose fields are defined as follows: 

public class Impressionist_Procedure_l { 
public Long ID; 
public Long ConCheck; 
public String Name; 
public Boolean Enabled; 
public Long Version; 
public String Children; . 
public String Author; 
public String Project; 
public Date CreationDate; 
public String Comments; 
public String UserFuncsAndSubs; 
public String Computer; 
public Boolean LoggingEnabled; 

} 

Database server process 130 maps this procedure to the table shown in FIG. 5C. 

The XML document to Java class field mapping method described in the 
ReadXMLNode dicussion permits the explicit extraction and storage of the Name, 
Author, Project, CreationDate, Comments, Computer and LoggingEnabled aspects of an 
Impressionist procedure and the aggregation of the Children (and UserFuncsAndSubs) 
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aspects of the Impressionist procedure into a single Java field and relational database 
table column. This permits the rapid organization of the database on explicit attributes 
and the flexibility of being able to store arbitrary procedure details. 

FIGS. 6A-6D illustrate portions of an XML document representing data from an 
experiment on a Parallel Polymerization Reactor™ available from Symyx Technologies, 
Inc. of Santa Clara, California. FIG. 6A shows a portion the XML document representing 
the data from the Experiment base class from a PPR experiment (derived ClassName 
PPRExperiment), including a set of 17 experimental logs collected as part of the 
experimental run, represented within the Logs collection sub-entity of Experiment, with 
each Log being sub-object of the collection. FIG. 6B shows a portion of the same 
document with data from Log 713 expanded. 

FIG. 6C shows the XML document for the PPRExperiment itself, including 
entities from the Experiment base class (with the Logs collection fully contracted), and 
several of the PPRElement objects that represent the data from the individual 
polymerization reactors. PPRElement 3456 is shown with all its data entities. In this 
implementation, PressurelD, TemperaturelD, Conversion©, PrePressurelD, 
PreTemperaturelD, and PreConversionID are data elements containing the ID number of 
XYDataSet objects, whose storage definition is discussed above. The TemperaturelD, 
PressurelD, PreTemperaturelD, and PrePressurelD XYDataSet objects are used to store 
the data traces from pressure and temperature sensors connected to the individual reactors 
on the device. The "Pre" designation is used to distinguish data collected before the 
catalyst is injected from data collected after injection. The ConversionID and 
PreConversionID XYDataSet objects represent the traces provided by a real-time 
calculation provided by the instrument software indicated the uptake of reactant gas in the 
reactor. 

FIG. 6D again shows the entities from the Experiment base class, the 
PPRElements collection fully contracted, and the PPRModules collection with 
PPRModule 293 expanded to show the PPRReactor objects and their entities. In addition, 
this portion of the XML document describes the reactor configuration, here a combination 
of configurations for modules of 8 reactors, where some settings such as temperature, stir 
speed, and pressure transducer settings are module-based, and some such as reaction 
quenching, are reactor-based. 
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This XML document, recording an experiment of 48 reactors/PPRElements 
configured as 6 PPRModules of 8 PPRReactors each, contains 1,634,500 bytes of data, 
exclusive of the 288 XYDataSet objects used to store traces of pressure, temperature, and 
conversion data both pre- and post-catalyst injection for each reactor/PPRElement 
(48*3*2). The total size of the XML docments required to represent an average PPR 
Experiment is approximately 40 Mb. 

The data from a PPRExperiment XML document is mapped to a number of Java 
objects: the base class Experiment, and its supporting sub-objects such as Keyword and 
Log; and the PPRExperiment itself, and its supporting sub-objects such as PPRElement, 
PPRModule and PPRReactor. Database server process 130 maps the data to Java classes 
as follows: 

public class Experiment { 
public Long ID; 
public Date CreationDate; 
public String CreatedBy; 
public Date LastModificationDate; 
public String LastModif iedBy; 
public Long Status; 
public String ClassName; 
public String Comment; 
public String Project; 
public Long Notebook; 
public String Pages; 
public Long RootExperimentID; 
public Long LibDesignID; 
public Long Flags ; 
public String Name; 
public Long Sequence; 
public OCollection Keywords; 
public OCollection Attachments; 
public Long ProtocolID; 
public Date Start Date; 
public Date EndDate; 
public OReference Equipment; 
public Long LibRows; 
public Long LibCols; 
public OCollection XMLAttachments; 
public Long ConCheck; 
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public String Log; 
public String Barcode; 
public String Type; 
public Long LibID; 
public OCollection Logs; 
public String Staff; 

} 

public class Keyword { 
public Long ID; 
public String Value; 
public OReference Experiment; 
public Long ConCheck; 

} 

public class Log { 
public Long ID; 
public Long ConCheck; 
public OReference Experiment; 
public Date StartTime; 
public String Name; 
public String LogData; 
public String Procedure; 
public String Resources; 
public String Substrates; 
public String RecipeData; 

} 

public class PPRExperiment extends Symyx. Experiment { 
public OCollection PPRElements; 
public OCollection PPRModules; 
public String StartupProc; 
public String ShutdownProc; 
public String Notes; 

} 

public class PPRElement { 
public Long ID; 
public Long ConCheck; 
public Long Position; 
public Long Flags; 
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public Long Status; 
public Long PressurelD; 
public Long TemperaturelD; 
public Long ConversionID; 
public Double FinalConversion; 
public Double FinalPressure; 
public Double FinalTemperature; 
public Long PreTemperaturelD; 
public Long PreConversionID; 
public Long PrePressurelD; 
public OReference Experiment; 
public Long LibID; 
public Long LibPosition; 

} 



public class PPRModule { 
public Long ID; 
public Long ConCheck; 
public OReference PPRExperiment; 
public Long LibID; 
public Double Temperature; 
public Double Tempera tureDeadband; 
public Double MaxTemperature; 
public Double StirSpeed; 
public Double Pressure; 
public Double PressureDeadband; 
public Double MaxReactionTime; 
public Boolean Enabled; 
public OCollection PPRReactors; 



public class PPRReactor { 
public Long ID; 
public Long ConCheck; 
public String QuenchMode; 
public Double QuenchValue; 
public OReference PPRModule; 

} 

Database server process 130 maps these classes to tables defined by the entity- 
relationship diagram is shown in FIG. 6E. 
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Database server process 130 retrieves data from database 180 using a method 
public String GetObject2(String objName, long ID, Boolean InclueBinaryData, Boolean 
UseBLOBFiles), exemplified in code Listing 2, below. 

This method accepts a string containing the name of the object to be retrieved, the 
ID number of the object, and 2 Boolean parameters that control retrieval of binary data 

The overall process is the reverse of the process of storing objects: retrieve the 
object by ID from the database and create a Java object by name and populate its fields 
(and sub-objects as necessary). This Java object is then mapped into an XML document 
and the document is returned to the requester as a string using the WriteXMLNode 
method. 

Client processes 140 (and users of those processes) see only the object 
representation of the underlying data and can interact with that data only through database 
server 130. In one implementation, client processes 140 interact with the underlying data 
through a proxy object server, such as a COM dll, configured to receive an XML 
representation of data from database server process 130 and construct an set of interfaces 
(consistent, e.g., with Microsoft's COM standard or the CORBA standard) that present a 
set of methods and properties representing a particular object or objects. Alternatively, 
client processes 140 can interact with the data directly through the XML representation — 
for example, by processing the XML document using a set of XSLT transformation rules 
to generate an HTML document which is then presented to the user in a Web browser as 
an Experiment write-up document. Because clients and users are isolated from the details 
of data storage in database 1 80, they can only manipulate data in ways explicitly 
permitted by database server process 130, which restricts their ability to retrieve or alter 
data without authorization. 

Client processes 140 can also evolve separately from the database 180. Thus, for 
example, if a client process 140 generating a new form of data (e.g., a "new" type of 
experiment for which no derived class has been defined in system 100) is added to system 
100 (or if an existing client process 140 is modified to generate a new form of data), the 
client process can simply embed the new data into the XML data stream sent to database 
server process 130, e.g., as an entity describing Experiment object (an instance of the 
Experiment base class) having a classname property identifying the new experiment type. 

Database server process 130 can be configured to map any new form of data (i.e., 
data that is not explicitly encompassed in the middle tier mapping schema) to default 
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storage such as an tc unknowns" column in database 1 80. Database 1 80 can be modified at 
a later date to handle this data explicitly. Alternatively, database server process 130 can 
be configured to generate an error message when it encounters an unrecognized form of 
data, notifying a database administrator of the presence of unrecognized data and 
providing a detailed explanation of the error (in the form of the XML stream) to enable 
the administrator to diagnose and fix the problem. Thus in the case of a "new experiment 
type" discussed above, upon encountering an unrecognized classname database server 
process 130 can be configured to create an instance of the Experiment base class 
describing the general characteristics of the experiment as discussed above, and to store 
the as-yet-unrecognized experimental detail embodied in the particular derived properties 
in generic storage, for later explicit treatment by a system administrator or the like. 
Alternatively, database server process 130 can be configured to generate new derived 
Experiment classes dynamically, using information contained within the XML 
representation as a framework for identifying and populating object properties, and/or 
mapping such properties to new or existing tables in database 1 80. 

A method 700 of using system 100 to retrieve data from database 180 is illustrated 
in FIG. 7 and 8A-B. The method begins in step 710, where, for example, the user selects 
File > New Query from a menu bar in a Queries window displayed by user interface 
program 160. Optionally, system 100 prompts the user to select a project or projects to 
search (step 720) - e.g., from a drop down list of projects defined in database 180. In 
response, system 100 establishes a workflow for searching stored data for the specified 
project or projects (step 730). The user can search by queryable fields defined for the 
specified projects, which may include, for example, library ID number, experiment 
number, experiment name, status, keyword or the like. The user can also search by 
particular types of experiments defined for the specified projects - for example, GPC 
Experiments for a gel permeation chromatography project or Synthesis Protocols for 
library synthesis. For each type of experiment, the user can select from project-specific 
fields defined in metadata for the experiment type - for example, GPC retention time, 
average polydispersity index, or the like for the gel permeation chromatography 
experiment referenced above. 

The user formulates a query by specifying a search domain (all experiments, 
experiment types, library elements, etc.) using, e.g., radio buttons 800 and/or drop down 
list 805 (step 740), selecting one or more search fields and a comparison operator in drop 
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down lists 810 and 815, and specifying a value to search against in Value box 820 (step 
750). The user can specify additional search terms by selecting an appropriate connector 
(e.g., and, or) in Combine box 825, in response to which system 100 causes user interface 
program 160 to display an additional row 830. Optionally, user interface program 160 is 
configured to indicate an incomplete or invalid query by identifying fields requiring 
specification by, for example, outlining or coloring the necessary fields in red. 

System 100 can represent queries as objects, XML strings and entries in relational 
tables in database 180. Queries are formulated as objects in user interface program 160. 
In response to a request to save or execute a search, user interface program 160 generates 
an XML string corresponding to the query object, which it transmits to database server 
process 130, Database server process 130 parses the XML stream and, to save the query 
in database 180, maps entities extracted from the XML stream to appropriate database 
tables as described above. The user can retrieve a saved query from database 180 by 
selecting a query from a list 840 of available queries previously stored in database 1 80, 
which may also identify, e.g., queries previously created during a particular session. 
Queries can also be exported to files for storage. 

The user runs the search by, e.g., selecting an appropriate menu item or button. 
User interface program 160 generates the appropriate XML string, which it sends to 
database server process 130 (step 760). Database server process 130 parses the data 
stream and generates appropriate commands in a data definition and/or manipulation 
language such as SQL, which it uses to search for responsive records in database 180 
(step 770). Database server process 130 searches the appropriate tables in database 180 
for any record that satisfies the specified search terms, and assembles the results into the 
form of a list object (step 780). The list object can subsequently be used by user interface 
program 160 to, for example, present the number of hits returned by the user's query, and 
it can be independently stored in database 1 80 for sharing with other users or for later use. 
Using the list, user interface program 160 requests a tabular representation of the object 
data for those objects on the list. The tabular representation capability of the server 
provides access to data for multiple objects in a compact form resembling a single 
relational table, and is useful for producing reports including selected data from many 
different objects or in presenting a list of objects from which the user might select one or 
more for full expansion. Furthermore, the tabular representation may be a more efficient 
means of retrieving data from a large number of objects as opposed to retrieving each 
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object via the XML document representation since it accomplishes the retrieval for all 
objects in a single operation and the data is "flattened" 

into a single table. However, the data available in the tabular representation may not 
include all the object data - specifically, it cannot be used to retrieve data from sub- 
objects or collections of sub-objects, since it is by definition a single table with each row 
representing a single object. For this reason, database server process 130 can also be 
configured to return results in an object form (e.g., as an XML stream), and clients 140 
can be configured to operate on data using both the object-XML representation for access 
to all object data and the tabular representation for access to data for a larger number of 
objects. In one implementation, the tabular representation may take the form of a 
Microsoft Active Data Object (ADO) Recordset object, where the data from the server is 
returned to the user interface client as an ADO Recordset object. Database server process 
130 returns the search result for display by user interface program 160 or use by other 
client process 140 (step 790). 

As discussed above, in one implementation database server process 130 formats 
the recordset as a list of elements that satisfy the terms of the query. Lists can be stored in 
database 180 and can be exported to files for storage. To store a list in database 180, the 
user selects File > Save List from a menu and selects the desired list from a set of 
available lists 850 (including, e.g., one or more lists generated by the user during a 
particular session). Optionally, user interface program 160 can prompt the user for list 
metadata, such as a flag indicating that the list should be public (i.e., made available to 
other users of system 100) or private, or a project name and/or description to be 
associated with the stored list. In one implementation, stored lists are static (i.e., database 
server process 130 does not update the list to include query-satisfying records added to 
database 1 80 after the underlying query was saved), while database server process 130 
updates the data content of records included in a stored list when the list is retrieved from 
database 1 80 or opened by user interface program 1 60. 

The user views a list by selecting the desired list from a set of available lists 
identified in Available Lists pane 850. Optionally, user interface program 160 provides a 
preview pane 860 showing list data for a current list - including, for example, columns 
corresponding to an element ID for each element satisfying the respective query terms, a 
library ID for the library containing the element, an experiment number, a position for the 
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element in the library, any flags set for the library, a library status, and experimental data 
such as average molecular weight. 

The user views data for a selected list by creating a Report (e.g., by selecting a 
menu item Report > Create Report). User interface program 160 and database server 
process 130 are configured to report experimental data in one or more formatted reports, 
such as spreadsheets, 2D or 3D plots, bubble plots, data grids and other known display 
formats. Optionally, database server process 130 can export recordsets for use in 
commercially available software application programs such as Spotfire, available from 
Spotfire, Inc. of Cambridge, Massachusetts. 

As shown in FIG. 9A, a spreadsheet 900 displays data for one or more selected 
fields fpr each element in a list. The user selects fields to display in a spreadsheet by, e.g., 
selecting the desired fields in a tree display 910 of available fields shown in FIG. 9B, 
which can include one or more displayable fields defined for the search domain specified 
as described above, such as average Mw, Mn, and PDI, data status or category, library ID, 
experiment ED or the like as shown in FIG. 9B. Displayable fields can also include 
images and data graphs, such as in chromatogram column 905 in FIG. 9 A. System 100 
generates a spreadsheet 900 having one or more rows corresponding to the elements 
satisfying the query, and one or more columns corresponding to the selected displayable 
fields. 

The displayed fields can, but need not necessarily, correspond to the queryable 
fields on which the database query was based, as described above. Rather, while 
formatting the data for display the user can select displayable fields in addition to or 
instead of the queryable fields used to construct the query that retrieved the data in the 
first instance. In addition to adding and removing displayable field columns from 
spreadsheet 900, the user can change the format of spreadsheet 900 by, e.g., changing 
fonts, column widths, data alignment and the like. The user can also merge field data that 
does not change across rows into single rows, move rows or columns in the spreadsheet, 
set minimum and maximum values for x and y coordinates of data graphs, and assign a 
status value to a row or rows in the spreadsheet. If database 1 80 includes multiple values 
for a particular field or fields for a given library element (e.g., values for multiple 
instances of a particular experiment performed on a particular library or library element), 
the user can choose to view all such values, or can choose to view the minimum, 
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maximum, or average of the stored values, or the first or most recent values stored in 
database 180. 

Similarly, the user can view the data as a two-dimensional plot, three dimensional 
plot or bubble plot, as shown in FIGS. 9C, 9D and 9E. Likewise, the user can also view 
data in the form of a data grid displaying experimental data (including graphs) as a grid of 
values for the rows and columns of each experiment (i.e., each library) in a selected list. 

A number of embodiments of the invention have been described. Nevertheless, it 
will be understood that various modifications may be made without departing from the 
spirit and scope of the invention. Accordingly, other embodiments are within the scope of 
the following claims. 
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Listing 1. 

* Score an object in the database. 

* Qparam xml XML stream representation of the object. 

* ©return the ID of the object stored (>« 2) or an error.: 

* 0 general error 

* -l unable to retrieve new object iD 

* -2 update conflict: ConCheck value in the database is not the same as the 

incoming object's 

* -3 invalid class 

* -4 mapping error 

* -5 database error (including db integrity check failure) 

* •£ exception in inner object save 

* -7 XML/obj schema mismatch: unknown object or attribute 

* -8 user ID not provided or not authenticated 

* -9 internal error 

V 

public long Save (String , xml) 

Log . setFile ( "Save" ) j 
if LlogLevel > 1) 

Log. Log ("TO", "Entering Save", 0)? 
try 

// parse the XML 

clientutilities.XMlHelper helper * new clientutilitiee.XMLHelper () / 
clientutilities. IXMLBntity docKode - helper -LoadDocument (ami) t 
return SaveXML (docNode) ? 

J 

catch (Exception ignore) 
{ 

Log.LogCE", "unable to load XML document", 1181) t 

finally 

if (jlogLevel > 1> 

Log.Log("TJ2" , "Exiting Save", 0); 

return -9j 

) 

private long SaveXML (clientutilities . IXMLBntity docNode) 

Log. setFile ("SaveXML"); 
if (JtogLevel > 1) 

Log.Log("U2", "Entering SaveXML", 0) ; 
long ret code » 0; 
try 
{ 

// parse the XML 

clientutilities. IXMLBntity rootNode » docNode.FiretChildO j 

// extract root node name, make into database object 

boolean exist ingObject - false j 

Object rootcbj - XMLNodeTc>Object (rootNode) ; 

if (rootObj null) 

{ 

Log. Log ("B", n Conflict in concurrency check value updating " + 

rootNode. getBntityNameO, 807); 
return -2y 

} 
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// get relevant object- level attributes 

1C (( (Long) (rootObj. getClassO .getFieldC'ConCfafiCk*) .get (rootObj))) ,longvalue( 
) > 0) 

exietingObject ■ true; 
else 

exietingObject = false; 
int. persistState « rootNode . Getl«ongAttribute.( "PersistState 0 ) ; 

// validate user and lode this session 

String user ID ° rootNode . GetSt ringAttribute ( "TJserlD" ) ; 

if {user ID null) 

{ 

I,og.Log("B D , "No userlD to update ■ + rootNode. getBntity»axne{) , 1517)? 
return -8; 

} 

// map each XML entity under the root into a field 
clientutilities.lXMLEntity node a rootNode. PirstChildO ? 
while (node 1- null) 

retCode « ReadXMLNode (node, rootObj, user ID) ? 
if (retCode < 0) 

^ Log-Log ("W", "UbdateO unable to read XML node: " + 

node.getEntityNameO + " of root entity » ♦ rootNode. getBntityNameO 

+ ° for user 0 + user ID, 668) ; 

return ret Code; 

else if (retOode > 0) 

^ // update the root object persist state based on sub- object processing 
persistState* (int)retGode; 

node • node .KextSibling ( ) / 

} 

// see if the root object has a "dassName" field- if so, populate it with 

the class name 
try 

^ Field typeFld * rootObj .getdassO lgetField("Classllarae") ; 
if (typeFld I- null) 

^ String typeName « 

Cb j ectNameFromdassName (rootObj . getClaaa ( ) . getKame ( ) ) ; . 
typeFld. set (rootObj, typeName) j 

■ } 

catch (Exception ignore) 

{ 
} 

// update a root object 

long newID - SaveGbject (rootObj , user ID, persistState); 

Log. Log ( ■01"/ "Saved « ♦ rootNode. getBntityNameO + n ID - " ♦ newID r 0); 

rootObj - null; 

return newID? 

} 

catch (Exception e) 
{ 
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Log.IiogCE", "Exception: ° ♦ e.toStringO , 738),* 
return 0; 

} 

finally 

// unlock session before returning 
AuthorizeUser ( ■ » ) ; 
if (JLogLevel > X) 

Log. Log ("TO 0 , »Exiting SaveXML", 0); 

) 

} 

// return < 0 error code, > 0 subobject persist state change 

private long ReadMJtode(clientutilities.IXMIiBntity node, java.lang. Object obj. 
String UserlD) 

Log . setFile { "ReadXMLKode" ) i 
try 

{ . . - 

if (node l« null) 

{ 

String nodeName » node . getEntityName ( ) ; 
Class obj Cla « obj .getClassO ; 

clientutilities.IXMLEntity subNode » node.FirstChildO ; 
// delete the object 

if (node. GetLongftt tribute ("PersistSt ate") « StateFlag.stateDeleted) 

_db . delete (obj ) ; 

return StateFlag.stateDeleted; 

} 

// delete the data (different from deleting the object!) 
else if (node . GetLongAttribute ( "Null ") — 1) 

^ Field fid - objCls .getField (nodeName) ; 
fid. set (obj, null); 
db.narkOpdate (obj ) ; 

} " 

// BLOB data nodes 

else if (node.QetStringAttribute(»File°) 1- null) 
{ 

Field fid - obj CI s . getField (nodeName) ? 

java.lang. Object oVal - EntityToFieldObjectValue (node r fid); 
fid. set (obj, oValJj 

) 

// otherwise we 1 re updating or inserting 
// skip entirely empty collection nodes 
else if (subNode l« null) 

String subnodeName « subNode . getEntityName ( ) ; 

// leaf nodes have data to extract 

if (subnodeName. equalsIgnoreCase("#text") ) 

Field fid = obj Cls. getField (nodeName) ; 

java.lang. Object oVal » EntityToFieldObjectValue (node, fid); 
fid, set (obj, oVal); 

} 

// subparents have references/collections to extract recursively 
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// subparent object reference- subnodes or the subparent have values: 
// <node><B>#text</B></node> * 

If subparent object collection: BUbnodes of the subparent do not have 
data: 

// <node><Bubnode><B>#text</B></subnode></node> 
else 

^ // get the field from the object 

Field collFld «= objdB.getField(nodeName) ; 

// if we have a string field, then store the entire XML node stream 
as a value 

if (collFld.getTypeO .getNameO . equalBlgnoreCase ( ■ j ava . lang . String" ) ) 

^ // 09-05-00 DD fix to store node XML, not subnode! 
java. lang. String nodeXML - ■<?xml version-'l.O 1 ?>° + 

node . GetXML ( false) ; 
collFld.set(obj, nodeXHL) ; 

} 

else 

^ // get the actual field instance (NOT the definition) 
Object collObj » collFld.get(ebj) ; 
String fldType ■ collObj.getClassO •getNameO ; 

// collection attributes 

if (fldType . equals ignoreCase ( 0 com . ob j ectmatter . bsf . OCollection" ) ) 
{ 

// try to create (or retrieve) the first subobject from the 
collection 

com . ob j ectmatter . bsf . OCollection collData » 

(com . ob j ec tmatter .bsf. OCollect ion) (collQbj) ; 
Object subObj - X^odeToOollectionObject (collData, subNode); 
boolean existingGbject « false; 

// otherwise loop over all subNode (objects) 

while (subNode l» null) 

{ 

// existing or new? 
if 

(((Long) (subObj.getClaBBO .getField(»ConCheox«') .get(subObj) ) ) .1 
outvalue () > 0) 
exiBtingObject ■ true; . 
else 

existingObject - false; 
// delete the object? 

if (8UbNode.GetLongAttritoute( ,, PerBistState ,, ) — 
StateFlag. stateDeleted) 

{ 

_db. delete (subObj) ; 

} 

// or insert/update 
else 

// loop over all attribute values 

dientutilities. IXMLBntity subsubNode » subNode . First Child ( ) ; 

while (subsubNode 1= null) 

{ 

String subsubNodeName ■ eubsubNode .getEntityName ( ) j 
long retCode «= ReadXMLNode ( subsubNode , subObj, UserlD) ; 
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// return immediate error 
if (retCode < 0) 

Log.Log( u E n l "Unable to read subNode * + subsubNodeName, 

sas) » 

return retOode; 

} 

subsubNode a BubsubNode . NextS ibl ing ( ) ; 

} 

if ( exist ingObj ect mm true) 

__db. marklTpdate (subobj ) ; 
else 

collData . add (subObj ) ; 



// get next another subobj ect 
eubNode - BubNode . NextS ibl ing ( ) ; 
if (eubNode I* null) 

BubObj = XMLNodeToCollectionObj ect (collData , eubNode) ; 



// this entity maps to a referenced object: 'node* is a subobj ect 

and the 
else if • 

(f ldType . equalslguoreCase ( 0 com . obj ectmatt er .bsf .GRef erence") ) 

com.objectinatter.bsf .©Reference refObj - 

(com, ob j ect matter .bsf . ORef erence) col 10b j ; 
boolean exist ingObj ect » false; 

// create (or retrieve) the object 

Object subObj - null; 

try 

// get node name and try to make a database object 

com. objectmatter. bsf .mapping. schema. ClassSchema echema * 

_db . getClassSchema (obj , getClaas ( ) : getName ()) ; 
String obj Name * 

schema .getAttribute (nodeName) . getRef classNarae () ; 
long ID m node. GetLongAt tribute ("ID") ; 

// this is an update (existing object) 
if (ID > 0) 

{ 

subObj - _db. lookup (obj Name, ID); 

// did not find object in database, use ZD and insert 
if (subObj »= null) 

subObj « _db. create (obj Name ) ; 

Field fid • subCbj.getaass().get?ield(*ID*)f 

fid. set (subObj , new Long (XD) ) ; 

fid «= subObj .getdassO .getPield("0cnCheck«) ; 

fid. set (subObj, new Long(O) ); 



// found object-ID in database, do a version check 
else 

{ 

Class subObjcis e subCbj.geta.ass(); 
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// check concurrency value of XML against the database 
long conValue » nc^e.GetI<mgAttribute("C3onCheck ,, > ; 
long dbConValue ° 

((Long) (subObjcaB.getPield( n ConChecfe").get(subObj))) .loncfVa 
lue(); 

if (conValue 1- dbConValue) 

{ w fc , 

Log.Log( n K» f "Conflict in concurrency check value 

updating " + node.getEntityNaioeO , 1721); 

return -2j 

} 

^ existingObject ■ true; 



} 



// this is an insert (new object) 
else 

< 

subObj - _db. create (objName) ; 

Field fid « subObj .getClaeBO.getPieldriD")? 

String newID m 

AllocateQbj ectID (Ob j e ctNameProroC3.asaName (objKarae) ) ; 

fid. set (8Ub0bj new Long (newID) ) ; 
fid « subObj.getaassO ,getPield( u OonC!heck") ; 
fid. set (subObj, new Long(0) ) / 
existingObject ■ false} 

} 

catch (Exception e) 

Log. Log ("B", "Unable to create subobject • ♦ nodeNaroe, 1742); 
return -7; 

> 

if (subObj lm mill) 
{ 

int persistState - node.QetLcmgAttribute ("PereiatState") ; 
// loop over all subobject values (attributes) 
. While (subHode 1- null) 

{ . 

long retCode - ReadXMUIode (subNode, subObj, DeerlD) ; 
if (retCode < 0) 

return retCode; 
subNode « subNode. NextSibling () ; 

} 

//if the referenced object is not owned, then update it if it 

doesn't have an ID already 
// in this case, the referenced object MOST be a root object 

in terms of attributes J 

if 

(_db • getClaesSchema (obj .getClassO .getNaroeO ) .getAt tribute (no 
deNaroe) . isOwnership ( ) false && 
refObj . isOontainedO false) 

{ 

if (AuthorizeUser(UserXD) -« false) . 
{ 

Log. log ("E" , "Unauthorised user updating " + 
subObj .getClass () .getName () , 827) ; 
return -8; 

// update a root object 
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long eubObjlD - UpdateRootObj ect ( subOb j , exist ingObject, 
persistState) ; 

if (eubObjlD < 0) return subObjID? 

// add a reference to this object to the collection if not 
existing 

// the actual field instance (NOT the definition) 
com. ob j ectmat t er . bsf . (Reference refData «= 

(com- obj ectmatter , bs f . ORef erence) (ob j ds . getPield (nodeKatne 

).get(obj)); 
refData . set ( subOb j ) ; 

// if the object wasn't able to be created, try and. see if we 

have a field in which to store the 
// entire subobj as a value 
else 

// only if this is a string field! 
coll Fid m obj C1b . getPield (nodeName) ; 
if (collFld I- null && 

collFld.getTypeO .getNaraaO . equal BlgnoreCase ( " j ava . lang . S tr 

ing")> 

j ava. lang. String subnodeXML - node .GetXML (false) ; 
collFld.8et(obj, subnodeXML)? 

} 

else 

// not able to map an XML entity to any object attribute. 
Log. Log CE" t "UHable to map " + nodeName + ■ entity to 

attribute or subject 1793); 
return -7; 

} 

return StatePlag . stateChanged; 

Log. Log ("B", "Exception: ■ ♦ e.toStringO , 933); 

} 

finally 

^ // clear userlD (from any subobj ect inserts) 
AuthorizeTJser ( ■ • ) ; 

} 

return -6/ 

} 

private j ava. lang. Obj ect EntityToPieldt&jectValue(clientutilities.IXiaEntity 
node, j ava .lang. reflect. Field fid) 

^ lK5g.BetPUe("EatityToPieldObjectValue") ; 
if {node «■ null | ] fid »« null) 

return null ; 
try 

{ 
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String fldType = fld.getTypeO .getNameO ; 

if (fldType.egualsIgnoreCase( n java.lang. Boolean") ) 

^ if (node.getLoncfValueO — 0) 

return new java. lang. Boolean (false) ; 

else 

return new java. lang. Boolean (true) ; 
else if (fldType.equalsIgnore^^ 

* return new java .lang. Double (node. getDoubleValue ()) ? 
else if (fldType.equalsIgnoreCase(»java.lang.Long n )) 

* return new javaaang .Long (node. getLongValuelH ; 
else if (fldType.eq^alsIgnoreCase(»java.lang.String»)) 
^ return new java. lang. String (node, getstring^alue ()) ; 
else if ( f ldType . equalsIgnoreCase ( » java.util .Date" ) ) 

^ return new java. util. Date (node. getStringfValueO ) t 
else if (fldType.eo^alsIgnoreCase( 8 tB B )) 

^ String fileName = node. GetStringllttritoute ("File") , msg? 
com.ms.com.SafeArray sa » null; 
if (filename 1- null && fileName . length ( ) > 0) 

{ clientutilities.FileHelper fh « new <=l^^ tiU ^ es ^ e ^^J^, , 
Sm.ms.com. Variant var - fh.ReadFile (fileName, FileType.f ileBinary) > 
if (var.getvtO «« com. ms.com. Variant .VariantArray + 
com.ms . com.Variant . VariantByte) 

^ sa - var.toSafeArrayO; 
msg ° "BLOB read from file" ; 

} 

else 

^ msg - "Unable to read BLOB from file"; 
} 

} 

else 

sa ■ node.get3inaryValue(cxmi.ms.com.Vari * 
jobs « "BLOB read from XML stream"? 

if (sa.getvtO — com. ins. com. Variant .VariantByte) 

int neleme « Ba.getOBoundO - sa.getlfioundO + 1? 
if (_logLevel > l) 

Log.Log("Ul", msg ♦ size " + nelems, 9B9) ; 
byte II ba « new byte [nelems] t 
0a.getBytes(O v nelems, ba r 0); 
if (fileName l« null && fileName. length 0 > 0) 

' java. io. File f « new java. io. Pile (fileName) ; 
f. delete (); 

return baj 

} 
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else 

^ Log,Log("W", "Uixmappable field typo " + fldType, 1015); 

>' 

catch (Exception e) 

^ Log.LogCE". "Exception i « ♦ e.toStringO , 1009) i 
} 

return null; 
> 



41 



WO 02/054188 PCT/US02/00466 



Listing^ 

* Retrieve a single object in XML form from the database. 

* ©param objName name of the object 

* ©param ID ID of the object 

* ©param IncIudeBlnaryData True to include binary data in the returned 
objects, False to include 

* data only from non-BLOB atttributes 

* oparam DseBLOBPiles True to transfer BLOB data through a file share, 
False to include the BLOB data 

* in the XML stream (only used if IncludeBinary Data is True) 

* ©return XML stream containing the object attribute data, empty string 
if an error occurs 

*/ 

public java.lang. String GetObject2( String objName, long ID, boolean 
IncIudeBlnaryData, boolean UseBLOBFiles) 

' Log.BetFUe( a GetObject2") ; 
if (_logLevel > 1) 

Log.Log("U3 H , "Entering GetObject2", 0); 
if (_db «• null) return »"; 
try 

^ String clsName - ClassNameFrcmObjectName (objHame) ; 
Class objds - Class. forName (clsName) ; 
if (objds ~ null) 
{ 

Log.Log{"W n , "No known objects " + clsKame, 567) r 
return ""j 

} 

if ( lo^Level > 1) 
Log.Log("U3 n , "Object + clsName + ". ID » " + ID + " retrieve", 

0); u % * 

java.lang.Object obj - _db. lookup (objCle, ID) ; 

if (obj — null) 

Log.Log("W", "Object not found: ■ + clsKame + ", ID « " ♦ ID, 573); 
return 99 t 

objClo - obj .getClassO ; 
if (_logLevel > l) 

Log«Log("D3", "Object " + clsName + ", ID » ■ + ID •+ • found", 0) i 

// make it into XML 

clientutilities. XMUIelper helper - new clientutilitieB .XMLHelper () / 

if (helper null) 

{ 

Log.Log("B", "Unable to create XMLHelper instance", 582) ; 
return 

) 

clientutilities . IXMLEntity docNode - helper . CreateDocument ( ■ " , " 0 ) t 
clientutilities . iXMLEntity rootNode = dccNOde.MdEntity(objName) ? 
if (rootNode — null) 

Log.Log("B", "Unable to create XML document 11 , 590); 
return 

WriteXMLNode (rootNode, obj, IncIudeBlnaryData , UseBLOBFiles);. 
if (JtogLevel > l) 
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Log. Log ("U3" , "Object ■ ♦ clsName + ", ID a n + ID + 9 XML object 

prepared" v 0); 
String ami o rootNode , GetXML ( true) ; 

if (J.ocfLevel > 1) 

Log.Log("U3°, "Object " + clsName + ° # ID • 0 + ID + a XML stream 
complete", 0); 

Log. Log ("Ul", «<3etObject2 returned ■ + objName + " " + ID, 0); 
// update counter 

cora.ms . wfc . app .RegietryKey regKey « 

com. ms . wf c . app . Regi s try . I£CAL_MACHINE • getSubKey ( " Sof twaxe\\Symyx 

Technologies \\SymyxDb n / false) ; 
if (regKey !■ null) 

^ Long tot - new Long( ( (String) regKey .getValue ( "ObjectsReturned" ) ) ) ; 
long total <=> tot.longValue() ♦ 1; 

regKey.setValue("CBtf ectsReturned", new Long (total) .toStxingO) r 
regKey* close () ; 

} 

return ami; 

catch (BQDBBxceptian e) 

Log.Log("E", "BODBBxceptiou: • ♦ e.toStringO, 600); 
return B0 ; 

catch (Exception e) 

Log.Log("B", "Exception: " + e.toStringO , 605); 
return ""; 

finally 

if (_lo$Level > 1) 

Log.Log("U3 n , "Exiting GetObject2", 0) ; 

} 

// used to keep list of current parents during recursive processing 
private static java.util. Stack parentNodeList - new java.util .Stack () ; 
private boolean VferiteXMUIode (clientutilities . IXMLKntity node, 
java.laug. Object obj, boolean includeftttachmente, boolean UseBIOBPilos) 

^ Log.setFile( w WriteXMLNode") ; 
try 

I 

Field fid; 

Class objCls « obj ,getClaos() ; 

// track objects by object class name and haBhcode 
parentNodeList . addElement (cibj .toStringi) ) ; 

// node attributes 

fid m objdB.getField(»ID"); 

node .AddLongAttribute ("ID" r ( (Long) fid. get (obj ) ) . intValue ( ) ) ; 
fid - objCLs.getPield(»ConCheck"); 

node .AddLongAttribute ("CanCbeck", ( (Long) fid. get (obj) ) .intValue () ) ; 
node .AddLongAttribute ( "Persia tState" , StateFlag. stateSaved) ; 

// now write public entity field values 
Field fids [] «= objCls.getFieldsO; 
for (int i « 0; i < fids. length; ++i) 
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{ 

fid e fids [i] ; 

if (Modifier.iePublictfld.getModifiereO)) 

String fldName «* fld.getNatne() ; 
if (fldName.equalBlgnoreCaBe("ID n ) — false && 
fldName . equalsIgnoreCase ( * ConCheck" ) false) 

String f IdType fld.getTypeO .getNameO ; 
if . 

(f IdType . equalsIgnoreCase ("cora.objectmatter .baf .OCol lection") 
{ ) 

Object tJ val « ( (OOol lection) fld.get(obj) ) -get () j 

if (val J- null) 

{ 

// create the overlying entity 
clientutilities. IXMLBntity subNode « 

node . AddEntity (fldName) ; 
// write each subentity 
for (int j - Oj j < val. length; ++j> 

Object ele « val [j] ; . 
String eleName - 

ObjectNamePromClaBcName(ele.getClass () .getName () ) ; 
clientutilities. IXMLBntity eleNode « 
subNode .AddEntity (eleName) t 

WriteXKLNode {eleNode, ele, InclUdeAfctachmenta , 
UseBLOBFilee) ; 

else if 

(f IdType . equalsIgnoreCase ( "com.objectmatter .bsf .ORef erence* ) ) 

// only write child references, not parents 
com. objectmatter.bsf . ORef erence refVal «= 
(com.objectmatter .bsf .ORef erence) fid. get (obj ) ; 
Object val - refVal.get () ; 

if (val I* null && parentNodeLiet. contains (val. toStringO ) ■ — 
false) 

{ 

clientutilities. IXMLBntity subNode ■ 

node . AddEntity ( fldName) ; 
// need to pop this object off the parent list to write it 
//parentNodeLiet .removeBlement (obj .t ©String () ) t 
WriteXMLNode (subNode, val, IncludeAttachments , 

UseBLOBPiles) ; 

>' 

else if (fldType.eoual8lgnoreCase(*java.lang.Bool.ean M )) 

Object val * fid. get (obj) ; 
if (val U null) 

{ 

clientutilities. IXMLBntity eleNode « 

node. AddEntity (fldName) ; 
if (((Boolean) (val)). booleanValueO true) 

eleNode . eetLongValue ( -1) ; 
else 

eleNode .BetLangValue(O) ; 
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else if <fldType.equaleIgnoreCase{"java.lang.Double°) ) 

* ( 

Object val « fld.get (obj) ; 

if (val 1- null) 

{ 

clientutilities . rXMT.Kntity eleNode * 

node.AddBntlty (fldNarce) ; 
eleNode . BetDoubleValue ( ( (Double) (val) ) .double Value () ) ; 

) } 

else if (£ldType.equal8lgm>xeCase(*java.laxi9.1jaxig 0 )) 

Object val c fld.get (obj) ; 
if (val I- null) 

clientutilities. IXMLEntity eleNode « 
node.AddEntity (f ldNaroe) t 
^ eleNode. eetLongValue ( ( (Long) (val) ) *int Value () ) ? 

elee if (fldType.eo^lBlgnoreCase 

Object val « fld.get (obj) ; 

if (val I- null) 

{ 

String eVal * (String) val i 

// value is to be intrepreted as in-line XML 

if (eVaa.startsWith^fxml") •« true) 

{ 

clientutilities ,XMLHelper helper « new 

clientutilitiee . XMLHelper ( ) ; 
clientutilities. IXMLEntity rootHode » 

helper . LoadDocuaent (sVal) ; 
// if it was xral, then add a (parsed) subnode 
If (rootKode l« null) 

{ 

node .AddSubnode (rootHode .GetEntity (f ldName)); 

} 

II otherwise the XML (string) is invalid. . .add a subnode 

and stick in a string 
else 

{ 

clientutilities. IXMLEntity eleNode « 
. node.AddBntity(fldWame) ; 
eleNode . se tStringvalue ( (String) val) ; 

) } 

// value is textual 
else 

{ 

clientutilities. IXMIfiatity eleNode « 

node.AddEntity(fldNama) \ 
eleNode . sets tringvalue ( (String) val ) ; 

} 

} 

else if (fldTyp«.eo^al8ZgnoreCase( ,, java.util.Z)ate*>)) 

Object val « fld.get (obj) ; 
if (val 1- null) 

( 

clientutilities. IXMLEntity eleNode m 
node.AddBntity(fldName) ; 
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OleDate od a new 01eDate{ (java.util. Date) val) ; 
^ eleNode . s etDateValue (od . toDouble ( ) ) f 

else if (fldType.equalsXgnoreCase(" IB")) 

{ 

if ( IncludeAttachments <m true) 
{ 

Object val * fid. get (obj) ; 
if (val !° mill) 

{ 

byte [) bao (byte [] ) val; 
int nelem & ba. length; 
if (nelem > 0) 
{ 

caientutil it ieB . IXMLBntity eleNode - 

node . AddEntity (f IdName) j 
com. ms.com. 6 afeArray sa » new 

com . tns . com . SafeArray (com . ins • com . Variant .Variant Byte 

, nelem) ; 
sa . f romByteArray (ba) ; 

com. roa .com. Variant va » new com. ms.com. Variant () ; 
va .putSaf eArray ( sa) ; 

if (tteeBLOBPiles mm false 1 1 nelem < minBLOBFil eSize ) 
{ 

eleNode. setValue(va) ; 
if (JLogLevel > 1) 

Log.Log("Ul B , "BLOB written to XML stream, size ■ ♦ 

nelem. 1335) j 

} 

else 
{ 

dientutilities .FileHelper fh « new 

dientutilities . FileHelper ( ) j 
if (fh I* null) 

{ 

String fName » th.GetUnigueFilename (_BLOBFilePath, 
"BLB") ; . 

fh.WriteFile (fName, va) ; 

el eNode . AddStringAttribute ( • File p , fName); 
if (_logLevel > 1) 

Log.LogCUl", "BLOB written to file, size » + ' 
nelem, 1345) ; 

} 

else 

lag.LocrCW", "Onmappable field type • + fldType, 1398); 

} 

} > 

// remove this parent from the processing stack 

parentNodeList . removeElement (obj . toS tring ( ) ) ; 
return true; 

} 

catch (Exception e) 

{ 

Log.Log( n E», "Exception: " + e.toStringO , 1184); 
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. return f aloe; 

) 
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WHAT IS CLAIMED IS: 

5 1 . A computer-implemented method for processing data from a chemical experiment 
on a library of materials, the method comprising: 

receiving data from a chemical experiment on a library of materials having a 
plurality of members; and 

generating a representation of the chemical experiment, the representation 
10 including data defining an experiment object having a plurality of properties derived from 
the chemical experiment, the experiment object being associated with the library of 
materials, the representation also including data defining one or more element objects, 
each element object being associated with one or more members of the library of 
materials. 

15 

2. The method of claim 1 , wherein: 

the chemical experiment has a type that is one of a pre-defined set of one or more 
experiment types; 

the representation implements a data model describing the one or more 

20 experiment types, the data model including an experiment base class having a set of 

experiment base class properties including a classname property for identifying a derived 
experiment class and a library ID property for identifying a library of materials, the data 
model also including one or more derived experiment classes, each derived experiment 
class being associated with one of the experiment types and having a plurality of derived 

25 experiment class properties derived from the associated experiment type; and 

the chemical experiment is represented in the representation by a first experiment 
object instantiated from the derived experiment class associated with the type of the 
chemical experiment, and by a second experiment object instantiated from the experiment 
base class, the classname property of the second experiment object having a value 

30 identifying the derived experiment class associated with the experiment type of the 

chemical experiment, and the library ID property of the second experiment object having 
a value identifying the library of materials. 



3. The method of claim 2, wherein: 
35 the representation includes data defining one or more data set objects, each data 

set object including data defining a set of values derived from the chemical experiment, 

48 



WO 02/054188 



PCT/US02/00466 



each value of the set of values being associated with one or more of the plurality of 
members of the library of materials, each data set object being associated with a property 
of the first experiment object. 

4. The method of claim 2, wherein: 

the representation includes data defining one or more image objects, each image 
object including data representing a state of at least a subset of the plurality of members 
of the library of materials at a time during the chemical experiment, each image object 
being associated with a property of the first experiment object. 

5. The method of claim 2, wherein: 

the representation includes a self-describing representation of the chemical 
experiment. 

6. The method of claim 2, wherein: 

the representation includes an XML stream, a Java object, a COM DDL interface 
or a CORBA IDL interface describing the chemical experiment. 

7. The method of claim 2, further comprising: 

parsing the representation to map the data from the chemical experiment to tables 
in a relational database based on the properties of at least the first experiment object. 

8. The method of claim 7, wherein: 

the representation includes an XML stream describing the chemical experiment; 

and 

parsing the representation includes identifying each of a plurality of XML entities 
in the XML stream, each entity having associated content; mapping each XML entity into 
a corresponding object property; and assigning the content associated with an XML entity 
to a database table based on the corresponding object property. 

9. The method of claim 2, wherein: 

the derived experiment class properties include one or more properties derived 
from one or more parameters of the associated experiment type. 
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5 1 0. The method of claim 3, wherein: 

at least one data set object includes data defining a set of values derived from a 
parameter of the chemical experiment; and 

the at least one data set object is associated with a property of the first experiment 
object derived from the parameter of the chemical experiment 

10 

1 1 . The method of claim 1 0, wherein: 

the set of values is a set of values measured for the parameter of the chemical 

experiment. 

15 12. The method of claim 8, further comprising: 

storing the content in the assigned database table in the relational database. 

1 3 . The method of claim 1 2, further comprising: 

receiving a query specifying search terms for one or more searchable data fields in 
20 the relational database; 

searching the relational database according to the query; and 

returning a search result including data identifying a set of one or more element 

objects satisfying the search terms of the query. 

25 1 4. The method of claim 13, further comprising: 

receiving an input specifying one or more displayable fields for display; and 
displaying one or more values associated with the specified displayable fields for 
each of the set of element objects satisfying the search terms of the query. 

30 15. The method of claim 13, further comprising: 

storing the search result as a list of element objects satisfying the search terms of 
the query. 

16. The method of claim 12, further comprising: 
35 receiving a database access request including an object identifier specifying 

content to be retrieved from the relational database; 

retrieving the specified content from a table in the relational database; 
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5 generating an object representation of the content based on a classname included 

in the specified content; and 

mapping the object representation to an XML stream describing the content 

17. A computer program product on a computer-readable medium for processing data 
10 from a chemical experiment on a library of materials, the computer program product 
comprising instructions operable to cause a programmable processor to: 

receive data from a chemical experiment on a library of materials having a 

plurality of members; and 

generate a representation of the chemical experiment, the representation including 
15 data defining an experiment object having a plurality of properties derived from the 

chemical experiment, the experiment object being associated with the library of materials, 
the representation also including data defining one or more element objects, each element 
object being associated with one or more members of the library of materials. 

20 18. The computer program product of claim 17, wherein: 

the chemical experiment has a type that is one of a pre-defined set of one or more 
experiment types; 

the representation implements a data model describing the one or more 
experiment types, the data model including an experiment base class having a set of 

25 experiment base class properties including a classname property for identifying a derived 
experiment class and a library ID property for identifying a library of materials, the data 
model also including one or more derived experiment classes, each derived experiment 
class being associated with one of the experiment types and having a plurality of derived 
experiment class properties derived from the associated experiment type; and 

30 the chemical experiment is represented in the representation by a first experiment 

object instantiated from the derived experiment class associated with the type of the 
chemical experiment, and by a second experiment object instantiated from the experiment 
base class, the classname property of the second experiment object having a value 
identifying the derived experiment class associated with the experiment type of the 

35 chemical experiment, and the library ID property of the second experiment object having 
a value identifying the library of materials. 

1 9. The computer program product of claim 1 8, wherein: 
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the representation includes data defining one or more data set objects, each data 
set object including data defining a set of values derived from the chemical experiment, 
each value of the set of values being associated with one or more of the plurality of 
members of the library of materials, each data set object being associated with a property 
of the experiment object. 



20. The computer program product of claim 1 8, wherein: 

the representation includes data defining one or more image objects, each image 
object including data representing a state of at least a subset of the plurality of members 
of the library of materials at a time during the chemical experiment, each image object 
15 being associated with a property of the experiment object. 

1 21. The computer program product of claim 1 8, wherein: 

the representation includes a self-describing representation of the chemical 
experiment. 

20 

22. The computer program product of claim 18, wherein: 

the representation includes an XML stream, a Java object, a COM IDL interface 
or a CORBA IDL interface describing the chemical experiment. 

25 23. The computer program product of claim 1 8, further comprising instructions 
operable to cause a programmable processor to: 

parse the representation to map the data from the chemical experiment to tables in 
a relational database based on the properties of at least the first experiment object. 

30 24. The computer program product of claim 23, wherein: 

the representation includes an XML stream describing the chemical experiment; 
and the instructions to parse the representation include instructions operable to cause a 
programmable processor to identify each of a plurality of XML entities in the XML 
stream, each entity having associated content; map each XML entity into a corresponding 

35 object property; and assign the content associated with an XML entity to a database table 
based on the corresponding object property. 



25. The computer program product of claim 1 8, wherein: 
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5 the derived experiment class properties include one or more properties derived 

from one or more parameters of the associated experiment type. 

26. The computer program product of claim 19, wherein: 

at least one data set object includes data defining a set of values derived from a 
10 parameter of the chemical experiment; and 

the at least one data set object is associated with a property of the first experiment 
object derived from the parameter of the chemical experiment. 

27. The computer program product of claim 26, wherein: 

1 5 the set of values is a set of values measured for the parameter of the chemical 

experiment. 

28. The computer program product of claim 24, further comprising instructions 
operable to cause a programmable processor to: 

20 store the content in the assigned database table in the relational database. 

29. The computer program product of claim 28, further comprising instructions 
operable to cause a programmable processor to: 

receive a query specifying search terms for one or more searchable data fields in 
25 the relational database; 

search the relational database according to the query; and 
return a search result including data identifying a set of one or more element 
objects satisfying the search terms of the query. 

30 30. The computer program product of claim 29, further comprising instructions 
operable to cause a programmable processor to: 

receive an input specifying one or more displayable fields for display; and 
display one or more values associated with the specified displayable fields for 
each of the set of element objects satisfying the search terms of the query. 

35 

3 1 . The computer program product of claim 29, further comprising instructions 
operable to cause a programmable processor to: 
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store the search result as a list of element objects satisfying the search terms of the 

query. 

32. The computer program product of claim 28, further comprising instructions 
operable to cause a programmable processor to: 

receive a database access request including an object identifier specifying content 
to be retrieved from the relational database; 

retrieve the specified content from a table in the relational database; 

generate an object representation of the content based on a classname included in 

the specified content; and 

map the object representation to an XML stream describing the content. 

33 . A computer-implemented laboratory data management system for processing data 
from a chemical experiment involving a library of materials, the system comprising: 

one or more client processes running on one or more computers coupled to a 
network, at least one of the client processes being operable to receive data from a 
chemical experiment on a library of materials having a plurality of members and generate 
a representation of the chemical experiment, the representation including data defining an 
experiment object having a plurality of properties derived from the chemical experiment, 
the experiment object being associated with the library of materials, the representation 
also including data defining one or more element objects, each element object being 
associated with one or more members of the library of materials; and 

a database server process running on a computer coupled to the network, the 
database server process being operable to receive the representation of the chemical 
experiment and to parse the representation to map the data from the chemical experiment 
to tables in a relational database stored in a memory coupled to the network based on the 
experiment object properties. 

34. The laboratory data management system of claim 33, wherein: 

the chemical experiment has a type that is one of a pre-defined set of one or more 
experiment types; 

the database server process implements a data model describing the one or more 

experiment types, the data model including an experiment base class having a set of 

experiment base class properties including a classname property for identifying a derived 
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5 experiment class and a library ID property for identifying a library of materials, the data 
model also including one or more derived experiment classes, each derived experiment 
class being associated with one of the experiment types and having a plurality of derived 
experiment class properties derived from the associated experiment type; and 

the representation includes a first experiment object instantiated from the derived 

10 experiment class associated with the type of the chemical experiment, and a second 

experiment object instantiated from the experiment base class, the classname property of 
the second experiment object having a value identifying the derived experiment class 
associated with the experiment type of the chemical experiment, and the library ID 
property of the second experiment object having a value identifying the library of 

15 materials. 

35. The laboratory data management system of claim 34, wherein: 

the representation includes data defining one or more data set objects, each data 
set object including data defining a set of values derived from the chemical experiment, 
20 each value of the set of values being associated with one or more of the plurality of 

members of the library of materials, each data set object being associated with a property 
of the first experiment object 

36. The laboratory data management system of claim 34, wherein: 

25 the representation includes data defining one or more image objects, each image 

object including data representing a state of at least a subset of the plurality of members 
of the library of materials at a time during the chemical experiment, each image object 
being associated with a property of the first experiment object. 

30 37. The laboratory data management system of claim 34, wherein: 

the representation includes a self-describing representation of the chemical 
experiment. 

38. The laboratory data management system of claim 34, wherein: 

35 the representation includes an XML data stream, a Java object, a COM IDL 

interface or a CORBA IDL interface describing the chemical experiment. 

39. The laboratory data management system of claim 34, wherein: 
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the representation includes an XML stream describing the chemical experiment; 

and 

the database server process is operable to identify each of a plurality of XML 
entities in the XML stream, each entity having associated content; map each XML entity 
into a corresponding object property; and assign the content associated with an XML 
entity to a database table based on the corresponding object property. 

40. The laboratory data management system of claim 34, wherein: 

the derived experiment class properties include one or more properties derived 
from one or more parameters of the associated experiment type. 

41 . The laboratory data management system of claim 35, wherein: 

at least one data set object includes data defining a set of values derived from a 
parameter of the chemical experiment; and 

the at least one data set object is associated with a property of the first experiment 
object derived from the parameter of the chemical experiment. 

42. The laboratory data management system of claim 41 , wherein: 

the set of values is a set of values measured for the parameter of the chemical 
experiment. 

43. The laboratory data management system of claim 39, wherein: 

the database server process is operable to store the content in the assigned 
database table in the relational database. 

44. The laboratory data management system of claim 43, wherein: 

the database server process is operable to receive a query specifying search terms 
for one or more searchable data fields in the relational database; search the relational 
database according to the query; and return a search result including data identifying a set 
of one or more element objects satisfying the search terms of the query. 

45. The laboratory data management system of claim 44, wherein: 

the database server process is operable to store the search result as a list of 

element objects satisfying the search terms of the query. 
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46. The laboratory data management system of claim 39, wherein: 

the database server process is operable to receive a database access request 
including an object identifier specifying content to be retrieved from the relational 
database; retrieve the specified content from a table in the relational database; 

generate an object representation of the content based on a classname included in 
the specified content; and map the object representation to an XML stream describing the 
content. 

47. A laboratory data management system for processing data from a chemical 
experiment involving a library of materials, the system comprising: 

means for receiving data from a chemical experiment on a library of materials 
having a plurality of members, the chemical experiment having a type that is one of a pre- 
defined set of one or more experiment types; 

means for generating a first representation of the chemical experiment, the first 
representation implementing a data model describing the one or more experiment types, 
the data model including an experiment base class having a set of experiment base class 
properties including a classname property for identifying a derived experiment class and a 
library ID property for identifying a library of materials, the data model also including 
one or more derived experiment classes, each derived experiment class being associated 
with one of the experiment types and having a plurality of derived experiment class 
properties derived from the associated experiment type, the first representation including 
data defining a first experiment object instantiated from the derived experiment class 
associated with the type of the chemical experiment, the first representation also 
including data defining a second experiment object instantiated from the experiment base 
class, the classname property of the second experiment object having a value identifying 
the derived experiment class associated with the experiment type of the chemical 
experiment, and the library ID property of the second experiment object having a value 
identifying the library of materials; 

means for parsing the first representation to map the data from the chemical 
experiment to tables in a relational database based on the properties of at least the first 
experiment object; and 
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5 means for generating, in response to a database access request, a second 

representation of the chemical experiment from data stored in the relational database 
based solely on data stored in the relational database. 

48. A laboratory data management system for processing data from a chemical 
10 experiment involving a library of materials, the system comprising: 

one or more client processes running on one or more computers coupled to a 
network, each client process being operable to generate a data set including a value for 
each of a plurality of members of a library of materials and to generate an intermediate 
representation of the data set, the intermediate representation including data defining an 
15 experiment object having a plurality of properties derived from the chemical experiment, 
the plurality of properties including a data set object assigning each first data set value to 
one of a set of elements corresponding to a plurality of members of the library of 
materials; 

a middle-tier server process running on a computer coupled to the network, the 
20 middle-tier server process being operable to receive the intermediate representation and to 
parse the intermediate representation to map the element values to database tables 
according to a predetermined database schema; and 

a database process running on a computer coupled to the network, the database 
process being operable to store the element values in a memory coupled to the network. 

25 

49. A computer-implemented data model for describing data from a set of pre-defined 
types of chemical experiments capable of being performed on a library of materials, the 
data model comprising: 

an experiment base class having a plurality of experiment base class properties 
30 including a classname property for identifying a derived experiment class and a library ID 
property for identifying a library of materials; 

one or more derived experiment classes, each derived experiment class being 
associated with one of the set of pre-defined types of chemical experiments capable of 
being performed on the library of materials and having a plurality of derived experiment 
35 class properties derived from the associated experiment type; and 

an element class having a plurality of element class properties including a position 
property for identifying one or more members of a library of materials and a value 
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5 property for storing a value derived from a chemical experiment for the members 
identified by the position property; wherein: 

a specific experiment in the set of pre-defined experiments is represented by a first 
experiment object instantiated from the derived experiment class associated with the type 
of the chemical experiment, and by a second experiment object instantiated from the 

10 experiment base class, the classname property of the second experiment object having a 
value identifying the derived experiment class associated with the experiment type of the 
chemical experiment, and the library ID property of the second experiment object having 
a value identifying the library of materials. 



15 50. A data structure for processing data from a chemical experiment involving a 
library of materials, the chemical experiment having a type selected from a set of pre- 
defined experiment types, the data structure comprising: 

a first experiment object representing the chemical experiment, the first 
experiment object being instantiated from an experiment base class and having a plurality 
20 of experiment base class properties including a classname property identifying a derived 
experiment class corresponding to the type of the chemical experiment, and a library ID 
property identifying the library of materials; 

a second experiment object representing the chemical experiment, the second 
experiment object being instantiated from the derived experiment class corresponding to 
25 the type of the chemical experiment, the second experiment object having a plurality of 
derived experiment class properties derived from the corresponding experiment type; 

one or more element objects, each element object being associated with one or 
more members of the library of materials; and 

one or more data set objects, each data set object including a set of values derived 
30 from the chemical experiment, each value of the set of values being associated with one 
or more element objects, each data set object being associated with a derived experiment 
class property of the second experiment object. 



5 1 . The method of claim 1 , further comprising: 

parsing the representation to map a subset of the data from the representation to 
tables in a relational database based on the properties of the experiment object. 



52. The method of claim 5 1 , wherein: 
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parsing the representation includes mapping a first subset of the data from the 
representation to a first set of database tables based on a first mapping schema and a 
second subset of the data from the chemical experiment to a second set of database tables 
based on a second mapping schema, the first mapping schema being different from the 
second mapping schema. 

53. The method of claim 52, wherein: 

the first subset of the data is different from the second subset of the data. 

54. The method of claim 1 , further comprising: 

parsing the representation to map the data from the representation to a first set of 
database tables based on a first mapping schema and a second set of database tables based 
on a second mapping schema, the first set of database tables being different from the 
second set of database tables. 

55. - The computer program product of claim 17, further comprising instructions 
operable to cause a programmable processor to: 

parse the representation to map a subset of the data from the representation to 
tables in a relational database based on the properties of the experiment object. 

56. The computer program product of claim 55, wherein: 

the instructions operable to cause a programmable processor to parse the 
representation include instructions operable to cause a programmable processor to map a 
first subset of the data from the representation to a first set of database tables based on a 
first mapping schema and a second subset of the data from the chemical experiment to a 
second set of database tables based on a second mapping schema, the first mapping 
schema being different from the second mapping schema. 

57. The computer program product of claim 56, wherein: 

the first subset of the data is different from the second subset of the data. 

58. The computer program product of claim 1 7, further comprising instructions 
operable to cause a programmable processor to: 
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parse the representation to map the data from the representation to a first set of 
database tables based on a first mapping schema and a second set of database tables based 
on a second mapping schema, the first set of database tables being different from the 
second set of database tables. 

59. The laboratory data management system of claim 33, wherein: 

the database server process is operable to map a subset of the data from the 
representation to tables in a relational database based on the experiment object properties. 

60. The laboratory data management system of claim 59, wherein: 

the database server process is operable to map a first subset of the data from the 
representation to a first set of database tables based on a first mapping schema and a 
second subset of the data from the chemical experiment to a second set of database tables 
based on a second mapping schema, the first mapping schema being different from the 
second mapping schema. 

61 . The laboratory data management system of claim 60, wherein: 

the first subset of the data is different from the second subset of the data. 

62. The laboratory data management system of claim 33, wherein: 

the database server process is operable to map the data from the representation to 
a first set of database tables based on a first mapping schema and a second set of database 
tables based on a second mapping schema, the first set of database tables being different 
from the second set of database tables. 

63. The laboratory data management system of claim 48, wherein: 

the middle-tier server process is operable to map the element values from the 
intermediate representation to a first set of database tables according to a first 
predetermined database schema and a second set of database tables according to a second 
predetermined database schema, the first set of database tables being different from the 
second set of database tables. 
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ing an experiment object having a plurality of properties derived from 
the chemical experiment. The experiment object is associated with the 
library of materials. The representation also includes data defining one 
or more element objects (250). Each element object is associated with 
one or more members of the library of materials (260). A data model 
and corresponding data structures for describing such experiments are 
also disclosed. 



1 


> 


RECEIVE EXPERIMENTAL 
DATA BET 


> 




PARSE DATA SET ^ 






GENERATE INTERMEDIATE 
REPRESENTATION 






SEND INTERMEDIATE 
REPRESENTATION TO 
DATABASE SERVER 
PROCESS 






EXTRACT OBJECT 
INFORMATION 


i 


* 


MAP OBJECT W FORMATION 
TO DATABASE 


1 


> 



210 



220 



<-240 



280 



260 



WO 02/054188 A3 



Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), (88) Date of publication of the international search report: 
European patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, 6 March 2003 

GB, GR, IE, IT, LU, MC, NL, PT, SE, TR), OAPI patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GQ, GW, ML, MR, 

NE, SN, TD, TG). . . 

For two-letter codes and other abbreviations, refer to the Uuia- 

Published: <"ice Mtfes on Codes and Abbreviations" appearing at the begin- 

— with international search report ning of each regular issue of the PCT Gazette. 



• 



INTERNATIONAL SEARCH REPORT 



International application No. 
PCT/US02/OO466 



A CLASSIFICATION OF SUBJECT MATTER 
IPq7) :G06F 17/30 

US CL : 707/6, 104; 435/6; 436/518, 548, 808 
According to International Patent Classification (IPC) or to both national classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 
U.S.: 707/6, 104; 435/6; 436/518, 548, 808 

Documentation searched other than minimum documentation to the extent that such documents are included in the fields 
searched 



Electronic data base consulted during the international search (name of data base and, where practicable, search terms used) 
WEST USPATFULL 



G DOCUMENTS CONSIDERED TO BE RELEVANT 



Category* 



Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



US 5,920,871 A (MARCRI et al) 06 July 1999 (07.06.1999) see 
whole document 

US 6,185,561 Bl (BALABAN et al) 06 February 2001 (02.06.2001) 
see whole document 



1-63 



1-63 



Q Further documents are listed in the continuation of Box C. [_J See patent family annex. 



"ET 
«0" 



Special categories of cited documents: 

document defining the general state of the art which is not 
considered to be of particular relevance 

earlier document published on or after the international filing date 

document which may throw doubts on priority claim(s) or which is 
cited to establish the publication date of another citation or other 
special reason (as specified) 

document referring to an oral disclosure, use, exhibition or other 
means 

document published prior to the international filing date but later 
than the priority date claimed 



"T* later document published after the international filing date or priority 

date and not in conflict with the application but cited to understand 
the principle or theory underlying the invention 

"X" document of particular relevance; the claimed invention cannot be 

considered novel or cannot be considered to involve an inventive step 
when the document is taken alone 

"Y w document of particular relevance; the claimed invention cannot be 

considered to involve an inventive step when the document Is 
combined with one or more other such documents, such combination 
being obvious to a person skilled in the art 

document member of the same patent family 



Date of the actual completion of the international search 



06 APRIL 2002 



Name and mailing address of the ISA/US 



Date of mailing of the international search report 

0 3 DEC 2002 



Authorized officer 
Telephone No. (703) 



70S) 305-8355 



